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We  consider  in  this  dissertation  estimation  and  prediction  of  local  area  means 
based  on  repeated  surveys  over  time.  A  general  hierarchical  Bayes  model  is  used 
for  this  purpose.  The  resulting  HE  estimators  for  a  small  area  "borrow  strength" 
over  time  in  addition  to  borrowing  strength  from  other  local  areas.  The  Bayesian 
procedure  provides  current  as  well  as  retrospective  estimates  simultaneous  for  all  the 
small  areas.  The  HE  methodology,  in  infinite  as  well  as  finite  population  sampling 
situations,  has  been  implemented  by  adopting  a  Monto  Carlo  integration  technique- 
the  Gibbs  Sampler. 

A  multivariate  HE  model  is  considered  to  exploit  the  inter-relationship  among 
the  parameters  in  improving  the  small  area  estimators.  Both  the  univariate  and 
multivariate  methodologies  have  been  implemented  on  median  income  estimation 
problem.  The  results  show  that  the  estimators  borrowing  strength  over  time  and 
from  other  local  areas  are  substantially  better  than  the  estimators  which  borrow 
strength  only  from  other  local  areas. 


In  finite  population  sampling,  the  predictive  distribution  of  a  characteristic  of 
interest  for  the  unsampled  population  units  is  found  given  the  observations  on  the 
sampled  units  and  is  used  to  draw  inference.  In  particular,  we  have  provided  HB 
predictors  and  the  associated  standard  errors  of  the  local  area  means  over  time.  In 
a  special  case  of  this  HB  analysis,  when  the  vector  of  the  ratios  of  the  variance 
components  is  known,  the  HB  predictor  of  the  vector  of  local  area  means  is  shown  to 
possess  some  frequentist  optimality  properties.  For  example,  among  other  properties, 
we  have  shown  that  the  HB  predictors  are  best  unbiased,  best  equivariant  predictor 
under  a  suitable  group  of  transformations. 


VI 


CHAPTER  1 
INTRODUCTION 


1.1     Literature  Review 

The  existence  of  small  area  (domain)  statistics  based  either  on  a  census  or  on  other 
administrative  records  aiming  at  complete  enumeration  can  be  traced  back  to  11th 
Century  in  England  and  17th  Century  in  Canada  (see  Brackstone  (1987)). /For  the 
past  few  decades,  sample  surveys  for  most  purposes  have  taken  the  place  of  complete 
enumeration  as  means  of  data  collection.  These  surveys  have  been  able  to  provide 
sample  based  estimates  of  adequate  precision  of  the  parameters  of  interest  at  the 
national  and  subnational  levels.  However,  the  overall  sample  size  in  such  surveys  is 
usually  determined  with  the  target  of  providing  specified  accuracy  at  a  much  higher 
level  of  aggregation  than  that  of  the  small  areas.  The  resulting  small  sample  sizes  for 
the  small  areas  are  usually  insufficient  to  provide  sample  based  estimates  of  adequate 
precision  of  parameters  at  the  small  area  level  and  are  likely  to  lead  to  large  estimation 
errors.  On  the  other  hand,  the  reliability  of  the  small  area  statistics  is  crucial  for 
regional  economic  planning  and  administration.  The  evaluation  of  state  economic 
policies  and  identification  of  the  subgroups  or  regions  of  the  population  which  are  not 
keeping  up  with  average,  relies  heavily  on  the  small  area  statistics.  Furthermore,  the 
decisions  concerning  the  investments  and  the  location  of  small  industrial  units  by  the 
corporate  sector  are  usually  based  on  the  local  social,  economic,  and  environmental 
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conditions.  As  a  consequence,  there  is  a  growing  interest  in  the  development  of  reliable 
small  area  statistics. 

In  recent  years,  several  model-based  estimation  techniques  have  emerged  result- 
ing in  improved  small  area  estimates.  The  essence  of  these  small  area  estimation 
techniques  is  to  borrow  strength  from  similar  neighboring  areas  for  estimation  and 
prediction  purposes.  These  estimation  techniques  use  models,  either  explicitly  or 
implicitly,  that  "connect"  the  small  areas  through  supplementary  data  (e.g.,  census 
and  administration  data).  Early  work  of  Ericksen  (1973,  1974)  employed  the  simple 
regression  method  that  combines  sample  data  with  symptomatic  and  census  data 
for  estimating  the  population  of  local  areas.  By  adopting  a  regression  model  similar 
to  that  of  Ericksen  (1974),  Fay  and  Herriot  (1979)  suggested  a  procedure  that  used 
a  constraint  weighted  average  of  the  model-dependent  and  sample-based  estimators 
with  weights  inversely  proportional  to  the  average  lack  of  fit  of  the  regression,  and  the 
variance  of  the  sample  estimate.  These  weighted  average  estimates  were  constrained 
to  be  within  one  standard  error  of  the  sample  estimates.  Ghosh  and  Rao  (1991)  have 
recently  surveyed  most  of  the  existing  small  area  estimation  methods. 

The  simultaneous  estimation  of  the  parameters  at  a  given  time  for  several  small  ar- 
eas (strata)  by  employing  a  general  linear  model  with  covariates  have  been  considered 
by  Fay  and  Herriot  (1979),  Ericksen  and  Kadane  (1985,  1987),  Ghosh  and  Meeden 
(1986),  Ghosh  and  Lahiri  (1988),  Battese,  Barter  and  Fuller  (1988),  Prasad  and  Rao 
(1990)  using  either  a  variance  components  approach  or  an  empirical  Bayes  (EB)  ap- 
proach. An  EB  approach  does  not  incorporate  uncertainity  due  to  the  estimation  of 
prior  parameters  and  thus  leads  to  underestimation  of  true  posterior  variance.  The 
underestimation  problem  gets  more  severe  when  one  or  more  variance  components  of 
the  model  are  unknown. 

An  alternative  hierarchical  Bayes  (HB)  procedure  for  the  small  area  estimation  at 
a  given  time,  has  been  considered  in  Ghosh  and  Lahiri  (1988)  and  Datta  and  Ghosh 


(1991).  Both  the  EB  and  IIB  procedures  often  lead  to  similar  point  estimates  of  the 
parameters  of  interest.  We  may  also  add  that  in  contrast  to  the  EB  approach,  the 
HB  approach  provides  small  area  estimators  that  have  natural  measures  of  standard 
errors  associated  with  them,  namely  the  posterior  standard  deviations. 

In  the  sample  survey  literature,  the  importance  of  model-based  approach  to  small 
area  estimation  has  been  well  recognized.  However,  the  model-based  techniques  for 
small  area  estimation  considered  so  far  borrowed  strength  only  from  similar  other 
areas  to  improve  small  area  estimators. 

,  However,  often  these  small  area  (strata)  populations  are  subject  to  change  in 
time.  Usually  the  time  series  information  is  available  through  the  repeated  surveys, 
(e.g.  market  research  survey,  crop  cutting  experiments  etc.)  carried  out  at  regular 
time  intervals  by  industries  and  government  agencies.  As  mentioned  in  Scott  and 
Smith  (1974),  Indian  National  Sample  Survey,  a  multipurpose  survey  carried  out 
annually  since  1950  and  the  Medical  Date  Index  in  The  U.K.,  a  market  research 
survey  of  doctors  are  examples  of  nonoverlapping  surveys  with  human  populations. 
In  such  surveys,  one  has  at  one's  disposal  not  only  the  current  data,  but  also  data 
from  similar  past  experiments.  Typically,  only  a  few  samples  are  available  from  an 
individual  area.  '  .  ,  .,,._. 

"  In  the  past,  time  series  models  have  usefully  been  applied  to  the  analysis  of  re- 
peated survey  data  from  the  random  samples.  Early  results  in  this  area  (see,  for 
example,  Jessen  (1942),  Patterson  (1950),  Rao  and  Graham  (1964),  Raj  (1965)  and 
Singh  (1968))  have  not  assumed  any  relationship  between  successive  values  of  the 
population  mean.  Blight  and  Scott  (1973)  and  Scott  and  Smith  (1974)  have  derived 
the  estimates  for  the  mean  of  a  time  dependent  population  using  a  first-order  au- 
toregressive  model  under  the  assumption  that  all  the  parameters  of  the  model  are 
known.  Rodrigues  and  Bolfarine  (1987)  considered  the  prediction  of  the  population 


total  (in  a  finite  population)  using  a  Bayesian  approach  based  on  the  Kalman  Filter 
estimating  algorithm. 

However,  no  results  concerning  the  estimation  of  the  parameters  for  the  small 
areas  under  the  dynamic  superpopulation  models  wherein  past  information  can  be 
effectively  used  in  conjunction  with  the  current  information,  are  available. 

-      1.2     The  Subject  of  This  Dissertation 

In  this  dissertation,  we  present  a  Bayesian  analysis  of  the  small  domain  data  in 
repeated  surveys.  Estimation  and  prediction  methodology  have  been  developed  for 
the  infinite  as  well  as  finite  population  sampling  situations.  We  consider  simultaneous 
estimation  of  several  strata  means  through  a  time  series  Bayesian  estimation  model. 

In  Chapter  2,  we  introduce  a  HB  model  for  simultaneous  estimation  of  several 
small  areas  means  in  infinite  population  sampling,  assuming  that  the  observations  are 
available  only  at  the  aggregate  level.  The  posterior  distribution  is  provided  through  a 
fully  Bayesian  procedure  which  uses  a  Monte  Carlo  numerical  integration  technique, 
the  Gibbs  sampling  algorithm.  The  resulting  small  area  estimators  "borrow  strength" 
over  time  in  addition  to  borrowing  strength  from  other  local  areas.  The  methodology 
developed  is  applied  to  the  problem  of  estimation  of  median  income  of  four-person 
families  by  states. 

In  Chapter  3,  we  present  a  multivariate  HB  model  for  simultaneous  estimation 
of  several  small  areas  means  in  an  infinite  population  sampling,  once  again  assuming 
that  the  observations  are  available  only  at  the  aggregate  level.  Our  aim  is  to  exploit 
the  inter-relationship  (if  any)  among  the  parameters  in  improving  the  small  area 
estimators.  We  have  also  implemented  the  proposed  analysis  on  the  median  income 
estimation  problem  considered  in  the  previous  chapter  through  the  use  of  median 
incomes  of  three-  and  five-person  families.    The  multivariate  analysis  in  this  case 


shows  definite  improvement  in  the  median  income  estimates  when  compared  to  the 
univariate  analysis  in  Chapter  2. 

Chapter  4  addresses  the  simultaneous  estimation  of  several  strata  means  at  dif- 
ferent time  points,  where  each  stratum  at  each  time  point  contains  a  finite  number  of 
elements.  We  assume  that  the  observations  are  available  at  the  unit  level.  The  HB 
model  considered  in  this  chapter  can  be  regarded  as  an  extension  of  the  work  of  Datta 
and  Ghosh  (1991)  who  used  general  linear  models  for  small  area  estimation.  While 
the  HB  predictors  of  Datta  and  Ghosh  (1991)  utiHze  the  data  available  at  a  given 
time,  our  HB  predictors  for  small  areas  utilize  the  data  available  at  a  given  time, 
along  with  all  the  data  available  at  previous  time  points.  Also,  in  addition  to  the 
current  small  area  estimates,  our  model  provides  the  retrospective  estimates  of  the 
small  areas  at  all  the  previous  time  points.  This  chapter  also  considers  a  special  case 
of  the  time  domain  HB  model  for  the  finite  population  sampling,  assuming  known 
variance  ratios.  In  such  a  set-up,  the  HB  predictors  have  been  shown  to  possess 
certain  optimal  properties.  It  has  been  shown  that  HB  predictors  are  best  unbiased 
predictors  within  the  class  of  all  unbiased  predictors.  Under  milder  assumptions, 
these  predictors  are  best  unbiased  linear  predictors.  The  stochastic  dominance  (cf. 
Hwang,  1985)  of  these  predictors  over  the  linear  unbiased  predictors  for  elliptically 
symmetric  distributions  have  also  been  established  in  this  chapter.  Finally,  we  have 
also  shown  that  these  predictors  are  also  best  equivariant  predictors  for  both  the 
matrix  loss  as  well  as  general  quadratic  loss,  under  suitable  group  of  transformations 
for  a  class  of  elliptically  symmetric  distributions.  .., 


CHAPTER  2      , 
ESTIMATION  OF  SMALL  AREA  MEANS  IN  TIME  DOMAIN 


2.1     Introduction 

This  chapter  considers  the  problem  of  small  domain  estimation  based  on  data 
from  repeated  surveys.  Estimation  of  the  characteristic  of  interest,  such  as  population 
mean,  change  in  mean  over  two  distinct  time  points,  or  other  parameters  of  interest  is 
considered  by  using  a  hierarchical  Bayes  (HB)  approach.  Such  models,  in  contrast  to 
the  ones  considered  in  Chapter  4,  will  be  appropriate  when  observations  are  available 
at  an  aggregate  level,  rather  than  at  a  unit  level.  The  previous  work  of  Datta, 
Fay  and  Ghosh  (1991)  used  a  HB  approach  for  analyzing  such  aggregate  data  for 
several  small  areas  at  a  fixed  point  of  time.  Also,  the  empirical  and  subjective  Bayes 
approaches  considered  earlier  by  Fay  and  Herriot  (1979),  Ericksen  and  Kadane  (1985, 
1987),  Ericksen,  Kadane  and  Tukey  (1989)  involved  similar  analysis  of  small  domain 
data  where  no  time  series  model  was  used.  One  common  theme  in  all  these  papers 
was  to  "borrow  strength"  from  similar  other  areas  to  obtain  improved  small  area 
estimators.  The  present  study  borrows  strength  over  time  as  well  in  addition  to 
borrowing  strength  from  other  local  areas.  ': 

On  another  front,  Bayesian  analysis  of  data  from  repeated  surveys  is  also  available 
in  the  literature  which  does  not  address  the  problem  of  simultaneous  estimation  from 
several  small  areas.  Mention  may  be  made  in  this  context  of  the  work  of  Blight  and 
Scott  (1972),  Scott  and  Smith  (1974)  who  considered  estimation  procedures  from  a 
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population  whose  mean  varied  over  time,  and  applied  standard  normal  time  series 
methods.  These  results  were  later  generalized  by  Rodrigues  and  Bolfarine  (1987) 
who  used  a  dynamic  linear  model,  and  used  Kalman  filter  estimating  algorithm. 

Our  model  encompasses  all  these  models  considered  earlier  as  special  cases  when  the  | 

number  of  local  areas  is  one  and  the  variance  components,  or  at  least  the  variance  =! 

ratios  are  known.  ; 

In  addition,  the  model  that  we  consider  in  Section  2.2  has  several  novel  features.  ] 

Most  of  the  models  considered  earlier  for  time  series  analysis  of  survey  data  eissumed  j 

known  variance  components.   The  present  work  does  not  need  such  an  assumption.  i 

Instead,  the  HB  method  assigns  distributions  (often  non-informative)  to  these  vari-  | 

i 

ance  components,  and  thereby,  in  effect  models  the  uncertainity  due  to  unknown  1 

"I 

prior  parameters.   In  consequence,  the  usual  Kalman  filter  updating  algorithm  does  , 

i 

not  work  any  more,  but  the  problem  is  addressed  in  a  more  realistic  manner  without  i 

invoking  the  assumption  of  known  variances,  or  known  ratios  of  variances.  Also,  the  ^ 
computational  problems  are  overcome  by  using  Monto-Corlo  integration  techniques 

such  as  Gibbs  Sampling  (see  Gelfand  and  Smith  (1990)).  j 

Our  model  also  bears  some  analogy  to  the  one  considered  in  Sallas  and  Harville  ^ 

(1988)  when  the  number  of  local  areas  is  one.   However,  while  the  Sallas  -  Harville  .] 

approach  is  primarily  an  empirical  Bayes  (EB)  approach  ,  ours  is  a  HB  approach.  -l 

The  advantage  of  the  present  approach  over  an  EB  approach  is  that  while  the  latter  i 

does  not  incorporate  uncertainity  due  to  estimation  of  the  prior  parameters,  and  as  l 
such  typically  underestimates  the  standard  errors  that  need  to  be  reported  with  the 
EB  estimates,  the  former  produces  more  honest  estimates  of  the  standard  errors. 

The  outline  of  the  remaining  sections  is  as  follows.  Section  2  describes  the  basic  1 
hierarchical  model,  describes  the  Gibbs  sampling  algorithm,  and  computes  the  nec- 
essary posterior  distributions  needed  to  implement  the  Gibbs  sampling  procedure. 
This  section  also  provides  formulas  for  the  posterior  mean  and  variance  via  Gibbs 


8 

sampling.  An  alternative  EB  procedure  is  described  in  Section  3.  The  methodology 
developed  in  Section  2  is  illustrated  with  the  problem  of  estimation  of  median  income 
of  four-person  families  by  states  in  Section  4. 


i 


I 


2.2     Description  of  the  Hierarchical  Bayes  Model 


Let  Yij  denote  the  standard  sample  survey  estimate  of  some  characteristic  dij  such 
as  the  population  mean  based  on  a  given  sample  at  time  j  for  the  i"*  small  area 
{i  =  1,  •  •  • ,  m;  j  =  1,  •  •  • ,  i).  Our  objective  is  to  optimally  estimate  a  linear  function 
Hill  Hj=i  UiQiji  using  all  the  data  available  upto  time  t.  In  particular,  we  may 
be  interested  in  estimating  current  population  means  for  all  the  small  areas  based 
on  several  sets  of  sample  means  available  over  time.  To  this  end,  we  consider  the 
following  HB  model: 

II.    %|a,6;,V'.~iV(a:?;a  +  z^6„0,)(i  =  l,---,m;i  =  l,...,O; 

III.    6,|6,_i,TF^iV(i/,6,_i,  W)(;  =  1,...,0; 
IV.     Marginally  a,  0i,  •  •  • ,  ;/.'<,  and  W  are  mutually  independent  with 
a  ^  Uniform{RF), 
Rj  (=  ip~^) -^  Gamma  i^aj,  ^ij), 
W~   inverseWishart  {S,k).  • 

In  (II),  Xij  and  Zij  are  known  design  vectors  of  size  p  and  q  respectively. 
We  say  that  a  random  variable  R  has  a  Gamma  (a,  6)  distribution  if  it  has  a  pdf 
of  the  form 

/(r)  oc  exp{--ar)r2'>-'^I^o,oo){r)■,  a  >  0,  6  >  0, 
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where  /  denotes  the  usual  indicator  function.  '  .  .,  ,  >; 

We  say  that  a,  q  x  q,  positive  definite  random  matrix  W  has  an  inverse  Wishart 
distribution  with  parameters  S  and  k  if  it  has  a  pdf  of  the  form 

f{w)  oc  |Ty|-<'+'=/2)exp   -^trace  [kSW-') 

where  A;  >  0  is  a  known,  scalar  degrees  of  freedom  parameter,  and  S  a.  q  x  q  positive 
definite  matrix.  This  means  that  W~^  has  a  Wishart  distribution  with  degrees  of 
freedom  k  and  scale  parameter  S. 

In  (IV),  we  shall  allow  the  possibility  of  diffuse  priors  for  Rj  or  W,  e.g.  Qj  = 
0,i,  =  0,  for   (i  =  l,...,i),  5  =  0. 

In  the  case  when  t  =  1,  the  suffix  j  can  be  omitted  and  Yij  =  V^  is  the  usual 
survey  estimate  of  the  small  area  parameter  ^,.  Write  xpj  =  tj;,  aj.v,  =  Xi.  In  such  a 
case,  we  can  write  the  model  I-III  as 

A.  Yi\er^'N{ei,  V,)- 

B.    0,\w'^^N{xf(x,xl;  +  zfwzi) 

Special  cases  of  this  model  are  considered  in  Fay  and  Herriot  (1979)  and  Ericksen  and 
Kadane  (1985,  1987)  where  z,  =0. 

In  the  case  when  m  =  1,  the  suffix  i  can  be  omitted  and  Y,j  =  Yj  is  the  usual 
survey  estimate  parameter  Oj  at  time  j.  In  such  a  case,  our  model  in  (I-III)  can 
be  compared  with  some  of  the  models  advocated  for  the  analysis  of  repeated  survey 
data.  To  see  this,  it  is  convenient  to  write  (ITII)  as:     . 

(i)    Y,=xJa  +  b,  +  e„ej^N{0,Vj),  ■  '     . 

(ii)    bj  =  Hjb,_i+u,,  Lj,^  N{0,W) 

We  first  consider  the  time  series  model  of  Scott  and  Smith  (1974)  for  analysis  of 
repeated  surveys.    The  special  case  of  their  model  when  a  first-order  autoregressive 
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process  is  assumed  for  the  characteristic  of  interest  9j  (which  in  this  setup  corresponds 
to  bj)  can  be  identified  with  (i)  and  (ii). 

In  general  stages  I  and  II  of  the  model  can  be  combined  into  a  single  stage 

Yij  =  xja  +  zjjbj  +  eij,  where  eij  ~  N{Q,  Vij  +  ^j)  - 

Our  model  bears  resemblance  to  the  one  considered  in  Sallas  and  Harville  (1988). 
However,  there  is  a  slight  difference  in  the  variance  structure.  To  see  this,  combine 
stages  I  and  II  of  the  model.  This  leads  to 


Yij  =  xJa  +  zlhj  +  dj, 


xnd 


where  e,-,  ~  iV(0,  K,,  +  0^).  In  contrast,  in  Sallas  and  Harville  (1988),  V{ej)  = 
a  Rj,  Rj  known,  cr'^  possibly  unknown,  where  ej  =  (eij,  ■  •  • ,  Cmj)^.  4 

We  emphasize  once  again  that  the  HB  structure  considered  in  this  chapter  assigns  1 

distributions  to  the  parameters  a,  0i,  •  •  • ,  ^^  and  W.  To  our  knowledge,  the  problem  '} 

1 
has  not  been  addressed  to  this  level  of  generality  before  even  within  a  Bayesian  ^ 

i 
framework.  J 

We  now  describe  Gibbs  sampling  algorithm  in  its  general  framework.  The  Gibbs 
sampling  is  a  Markovian  updating  scheme  which  requires  sampling  from  full  condi- 
tional distributions.  Densities  are  denoted  generally,  by  square  brackets  so  that  the 
joint,  conditional  and  marginal  densities  appear  as  [C^  V],  [t/'lF],  [F],  etc.    Given  a  j 

collection  of  random  variables  (real  or  vector  valued)  Ui,---,Uk,  the  joint  density 
[^1,  •  •  • ,  Uk]  is  uniquely  determined  hy  [Us\Ur,r  ^  s],s  =  1,- ■  ■  ,k.  under  certain  mild 
conditions  (see  Beseg  1974,  pp  194-195).     The  interest  is  in  finding  the  marginal  | 

distributions  [£/,],  s  =  1,- ••,  A;.  -  *       •'         '"* 

■j 
Gibbs  sampling  algorithm  is  as  follows:  given  an  arbitrary  set  of  starting  values,  *| 

Ui°\---,ui'\     draw     Ui'^     from     [/7i|C/f\  •  •  • ,  ^ri-  Then     draw     t/f'     from 

[U2\Ui'\ui''\---,Ui%---,Ui'^  from  [Uk\Ul'\---,Ui'],]  to  complete  one  iteration 
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of  the  scheme.  After  t  such  iterations,  arrive  at  (t/('\  •  •  • ,  u]^^).  There  are  two  alter- 

kg. 


native  ways  to  obtain  i.i.d.   k  -  tuples  {U[^^ ,  •  •  • ,  U\^h,{g  =  1,  •  •  • ,  G)  from  the  joint 


distribution: 

•  Parallel:  Replicate  the  entire  process  of  t  iterations  in  parallel  G  times  storing 
only  the  last  {t'^)  iterate  to  provide  {U['^ ,■  ■  ■  ,U'^^),  g  =  I,- ■  ■  ,G. 

•  Sequential:  Employ  only  one  single  long  run  oi  t  x  G  Gibbs  iterates,  storing 
every  <"*  iterate  to  provide  (t/(^',  ■  •  • ,  U^^),  ^  =  1,  •  •  ■ ,  G.    ; 

Gelfand  and  Smith  (1990)  adopted  the  parallel  algorithm  in  simulating  samples 
from  posterior  distributions.  These  simulated  sample  observations  can  then  be  used 
for  estimation  of  any  of  the  marginal  densities.  In  particular,  assuming  that  the 
conditional  density  f{U,\Ur,r  ^  s)  is  available  in  closed  form,  a  Rao-Blackwellized 
sample-based  estimate  of  the  marginal  density  of  U,  (see  Gelfand  and  Smith  (1991)) 
is  given  by    .      .  .  .  ,  ■,  . 


m.)  =  -^'Enu.ius,r^s).  ^        .^-^ 


3=1 


i 

1 

We  will  also  be  employing  the  parallel  implementation  of  the  Gibbs  sampling  algo-  ] 

rithm  in  our  analysis  of  the  time  domain  HB  model,  { 

In  order  to  implement  the  Gibbs  sampling  algorithm  we  require  samples  from  the  'i 

following  complete  conditional  distributions:  I 

..'■•■  -'ill 

- ,     -  i 

-■■;;■  ■•     '  I 

•  6,|y,^,a,6i,--,6,_i,6,+i,...,6j,ri,..-,rt,Ty,  j  =  l,...,f 

•  ri,--,ri|y,0,6i,  ••■,6<,PF 


•  ^.jly,  61,  •••,6«,ri,  •••,r<,Pr,  (?•  =  !,...,  m;i  =  1,...,0  j 

"  .  '  •  J 

The  hierarchical  structure  of  the  model  imphes  that  .  | 

■  ,  ■  ..  -.^ 

"  ¥\         (i)        oc\y,e,bu--,bt,ri,--,rt,W  ^a\e,bu--,buri,--,rt  | 

{iia)    bj\y,e,a,bj,^j,ri,-  ■  ■  ,rt,W  r^  bj\e,a,bj-i,bj+i,rj,W 

{j  =  l,...,t-l) 
{lib)     bt\y,e,a,b,,^„rr,--,rt,W  ^bj\d,cx,bt.urt,W 
(in)     rj\y,e,a,bi,---,bt,W^rj\a,e,bi,---,btiJ  =  l,...,t)     ,      .. 
{iv)      W\y,e,bi,--,buru--,rtr^W\bu--,bt 
(v)       ^i:\y,bu---,bt,ri,---,rt,W  ^eij\yij,rj,b,,a  ■: 

(i  =  l,...,m;j  =  l,...,f)  .1 

Formal  derivation  of  the  complete  conditional  distributions  is  now  straightforward 

and  is  omitted.  The  complete  conditional  distributions  are  given  by  J 

i 

■■■■'  ■  ■  •  ■■■  ■■  '^    ■.  "   ■  i 

(lia)      For   (i  =  1,  •..,<- 1)  :  ■    -  j| 

bj\y,e,a,bi,--,bj_ubj+u--^bt,ru--,rt,W 

x{{ZT=izJ^z,j)-'EZ,zf^{0,,-xf^c.) 

+  Hj^,W-'bj+^  +  W-'H,b,.r}, 


r^.7= 
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{iib)    bt\y,e.a,bu--,buru--,rt,W 
(m)    ri,---,r,iy,d,6i,...,6,,-W^ 


~    Gamma(l(a,  +  E:^iE;=i^u-a;.,«-z.,6,)2,  l{h,  +  m)) 


\  ^ 


M     W\y,e,b,,--,bun,--,rt 

~  Inverse  Wishart(S  +  E'=i(6,  -  Hjbj_,){bj  -  Hjb,_^f,  k  +  t) 

Our  objective  is  to  find  the  posterior  expectation  and  variance  of  6ij  given  y.  A 
Rao-BIackwellized  estimate  for  E{Oij\y)  based  on  simulated  samples  from  the  above 
conditional  distributions  can  be  approximated  by 

^  E  ^  (%ly, "  -  «i^  W  =  W^(0,  6,  =  6,„  r,  =  r,„i  =  1,  ■ . . ,  i) 

^  3=1  ^ 

The  posterior  variance  can  be  approximated  by 

5^  [E^e,,\y,  oc  =  a('),  W  =  Wf,  6,  =  6,„  r,  =  r,„;  =  1,  •  •  • ,  i) 

-^  (E?=i  ^  Kly, «  =  aj^  W^  =  Wf,  b,  =  6,„  r,  =  r,„  J  =  1,  •  •  • ,  0)' 


rffrii»T-v 
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2.3     Empirical  Bayes  Estimation 


t  •" 


In  an  EB  formulation,  we  assume  (I-III)  of  the  model,  but  not  (IV).  Also,  it  is 
assumed  that  W  is  a  known  matrix.  In  such  a  case,  we  may  write  the  model  as 


ind 


Kil%    -"   iV(%,K-,) 


>  -, .  I    » 


ind 


i-1 


%     ~     A^    xja,  0,  +  zJ(W  +  X:i/,HjK- 


/=i 


assuming  known  60,  i.e.,  60  =  0  without  any  loss  of  generality. 
It  follows  from  2.3.1  that 


'«j 


'ij 


N 


where 


xtcx  )  '  1^  A,-  Ai 


J-1 


From  (2.3.2),  the  posterior  distribution  of  %  given  Yij  can  be  obtained  as 


(2.3.1) 


(2.3.2) 


(2.3.3) 


%|y;-,  '^  N  [xja  +  A,(A,  +  V,,)-\Yi,  -  xj^cx),  A,Ki(A,  +  V,,)-']     (2.3.4) 
The  Bayes  estimator  of  dij  under  any  quadratic  loss  is  its  posterior  mean,  and  is  given 

by  •  ■: 


^J  =  xf^<x  +  A,(A,  +  V,jr\Y,,  -  xlcx) 


(2.3.5) 


which  may  also  be  seen  as  a  weighted  average  of  the  survey  estimate  Y,j  and  xJjOi. 

In  an  EB  analysis,  at  least  one  of  the  parameters  a  and  ipj,   {j  =  I,---,i);  is 
unknown  and  needs  to  be  estimated  from  the  marginal  distribution  of  the  Kj's. 

First  we  estimate  a.  Marginally 
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Y,,  '^'  N  {xla,  Ai  +  V;-,) 


Using  the  method  of  maximum  likelihood,  o:  can  be  estimated  by 


a  = 


m       t 


j:E^':^i/iDij+v,j) 


1=1 j=i 


-1 


m       t 


EE^;^./(A,+Ki) 


i=l  J  =  l 


(2.3.6) 


An  EB  estimator  of  6ij  is  obtained  by  substituting  d  for  a  in  (2.3.5)  and  is  given  by 


lEB 


ef^"  =  xija  +  Di,{Dii  +  Vii)-\Y^  -  xi^cc) 


(2.3.7) 


The  EB  approach  now  attempts  to  estimate  ipj  by  maximum  likelihood  method  or 
some  other  standard  method.  For  example,  noting  that 


^(v;-,-  -  xJd)V(A,  +  K-,) 


1=1 


=  m-q, 


one  can  solve  iteratively 


X:(>^,-x;d)V(A,  +  K,)  =  m-9 


(2.3.8) 


i=l 


in  conjunction  with  (2.3.6),  to  find  V'j  if  a  positive  solution  for  V'j  is  found  and  letting 
%l^j  =  0,  when  no  positive  solution  could  be  found. 

We  observe  that  the  EB  approach  and  the  HB  approach  (considered  in  Section  2.2) 
will  lead  to  the  same  point  estimator  of  6ij.  However,  the  EB  procedure  by  ignoring 
the  uncertainity  in  estimating  the  unknown  parameters  will  lead  to  the  underestima- 
tion of  the  true  posterior  standard  errors.  The  complete  HB  analysis  though  involved, 
provide  more  reliable  estimates  of  standard  errors.  It  is  for  this  reason,  we  will  be 
employing  only  the  HB  approach  in  the  illustrative  example  considered  in  Section 


2.4. 
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2.4     An  Application  of  Hierarchical  Bayes  Analysis 

In  this  section,  we  apply  the  HB  procedure  developed  in  Section  2.2.  to  an  actual 
data  set.  Our  application  concerns  estimation  of  median  income  for  four-person 
families  by  state.  Before  employing  the  HB  model  for  median  income  estimation,  we 
will  briefly  give  a  background  of  this  problem.  The  details  are  given  in  Fay  (1987). 

The  U.S.  Department  of  Health  and  Human  Services  (HHS)  administers  a  program 
of  energy  assistance  to  low-income  families.  Eligibility  for  the  program  is  determined 
by  a  formula  where  the  most  important  variable  is  an  estimate  of  the  current  median 
income  for  four-person  families  by  states  (the  50  states  and  the  District  of  Columbia). 

The  Bureau  of  the  Census,  by  an  informal  agreement,  has  provided  estimates  of 
the  median  income  for  the  four-person  families  by  states  to  HHS  through  a  linear 
regression  methodology  since  the  latter  part  of  the  1970s.  Two  basic  approaches  have 
been  used  with  change  in  methodology  occurring  for  income  year  1985.  In  this  earlier 
method,  sample  estimates  Y  of  the  state  medians  for  the  most  current  year  c,  as 
well  as  the  associated  standard  errors  are  first  obtained  through  Current  Population 
Survey  (CPS).  The  estimates  Y  were  used  as.  dependent  variables  in  a  linear  regression 
procedure  with  the  single  predictor  variable  '  *  ' 

Adjusted  census  median(c)  =     [BEA  PCI(c)/BEA  PCI(6)] 

(2.4.9) 
.  X  Census  median(6), 

where  census  median  (6)  represents  the  median  income  of  the  four-person  families  in 
the  state  for  the  base  year  b  from  the  most  recently  available  decennial  census.  In  the 
above  BEA  PCI(c)  and  BEA  PCI(6)  represent  respectively  estimates  of  per-capita 
income  produced  by  the  Bureau  of  Economic  Analysis  of  the  U.S.  Department  of 
Commerce  for  the  current  year  c,  and  the  base-year  b,  respectively  .  Thus  (2.4.9) 
attempts  to  adjust  the  base  year  census  median  by  the  proportional  growth  in  the 
BEA  PCI  to  arrive  at  the  current  year  adjusted  median.  Moreover,  in  developing  the 


p!"!Si 
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model,  the  states  were  divided  on  the  basis  of  the  population  into  four  groups  of  12 
or  13  states  each.  Model  variances  for  each  were  computed  as  the  average  of  squared 
residuals  of  1969  census  medians  fitted  by  a  linear  regression  with  an  intercept  term 
along  with  the  adjusted  census  median  for  1969  (as  given  in  (2.4.9))  as  a  covariate.  j 

Next  a  weighted  average  of  the  CPS  sample  estimate  of  the  current  median  income 
and  the  regression  estimates  is  obtained,  weighting  the  regression  estimate  inversely 
proportional  to  the  model  error  due  to  the  lack  of  fit  for  the  census  values  of  1969 
by  the  model  with  1959  used  as  the  base  year,  and  weighting  the  sample  estimate 
inversely  proportional  to  the  sampling  variance.  From  a  Bayesian  angle,  the  above 
composite  estimate  is  obtainable  from  the  simple  hierarchical  model  of  Lindley  and  ^ 

Smith  (1972).  Finally,  the  above  composite  estimate  was  constrained  not  to  deviate  ! 

by  more  than  one  standard  deviation  away  from  the  corresponding  sample  estimates.  i 

Thus  the  final  estimates  were  analogous  to  the  "Limited  Translation  Estimates"  as  J 

discussed  for  example  in  Efron  and  Morris  (1971)  or  Fay  and  Herriot  (1979).  ^ 

The  model  was  reviewed  once  the  median  income  of  four-families  by  state  in  1979  i 

became  available  from  the  1980  census.  The  following  modifications  to  the  previous  1 

modelling  were  suggested  on  the  basis  of  this  review.  1 

First,  it  was  decided  to  consider  a  single  level  of  model  error  across  all  the  states. 
The  reason  was  that  the  differences  between  the  estimated  model  errors  for  the  larger 
and  smaller  states  greatly  exaggerated  more  current  differences.  Second,  the  predic-  ] 

tive  value  of  the  regression  appeared  usefully  improved  if  in  addition  to  the  adjusted  j 

-i 

census  median  (c),  the  census  median  (b)  was  also  included  as  a  second  indepen-  1 

■i 
dent  variable.    The  idea  was  to  adjust  for  any  possible  overstatement  of  the  effect  j 

of  change  in  BEA  income  upon  the  median  incomes  of  four-person  families.    Also,  jj 

the  one  standard  deviation  constraint  was  removed.  This  matter  is  discussed  in  Fay  ^ 

(1987). 
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So  far,  the  entire  time  series  of  the  sample  median  income  estimates  available 
for  each  year  from  the  preceding  census  have  not  been  used  to  obtain  the  median 
income  estimates  for  the  four-person  families.  Our  model  in  Section  2.2  exploits 
the  past  time  series  information  available  at  a  given  time  in  addition  to  the  recent 
most  census  medians.  Before  we  describe  the  hierarchical  model  for  median  income 
estimation,  we  need  to  introduce  a  few  notations.  Let  Yij  denote  the  median  income 
of  four-person  families  in  state  i.  The  true  median  corresponding  to  Yij  is  denoted  by 
Oij.  Let  aj.ji  and  Xij2  denote  respectively  the  adjusted  census  median  income  for  the 
year  j  (as  given  in  (2.4.9)),  and  the  base-year  census  median  income  for  four-person 
families  in  the  i*''  state  (z  =  1,  •  •  • ,  51).  Finally,  let 

xJ  =  (la:i,iX,-,2),  (z  =  l,-..,51;  j  =  l,---,10),  (2.4.10) 

and  denote  by  a  =  (ai  Q2  aj)^  the  vector  of  regression  coefficients.   The  following 
hierarchical  model  is  used  to  estimate  the  ^,j's: 

L         Yij\9ij,Vij   are  mutually  independent  with 

Yij'^'N{0ij,Vi,),{t^l,-..,51;j  =  l,---,lO) 
II.      ^.,|a,6„V/~iV(a;^a  +  6„  0,),  (^  =  1,  •  •  •  ,51;  ;  =  1,  •  •  • ,  10) 
IIL    b,\b,_uT''^'N{b,.^,T')U  =  l,...,10)         .- 
IV.      Marginally  a,  ?/'i,  •  •  • ,  0io,  and  r^  are  mutually  independent  with 
a  ~  uniform{R^), 

rj  =  t/'7^  ~  GammaiO,  -1),  (j  =  1,  •  •  • ,  10) 
■     ro  =  {t'^)~^ '^  Gamma{0,0), 

To  find  the  estimates  of  the  median  income  of  four-person  families  and  their  standard 
errors,  we  need  to  use  the  Gibbs  sampling  algorithm  and  require  samples  from  the 
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following  complete  conditional  distributions: 


(. 


•  a|y,d,6i,---,6io,ri,--,rio,ro  ! 

i 


■  •  ^j|y,^.«,&i,---,&j-i,ii+i,---,6io,ri,---,rio,ro,  i  =  1,...,10 


'''.'  ■■     .♦..-.  .-  -.j- 


•  n,---,rio|y, 0,61, •••,610,  ■     •'  -"^   ■■■"    ^ 

•  ^o|y,^,ii,---,6io,ri,---,rio 

•  0ij\y,  ii,  •  •  • ,  610,  ri,  •  •  • ,  r-io,  ro,  (f  =  1, . . . ,  51;  i  =  1, . . . ,  10) 
Again,  the  hierarchical  structure  of  the  model  implies  that 

(i)        oc\y,9,bi,---,bio,ri,---  ,rio,ro  ~  Q;|^,  61,  •  •  • ,  6io,ri,  •  •  •  ,rio 
{iia)    bj\y,9,a,bj_i,bj+i,rj,ro  ~  bj\e,a,bj.i,bj+urj,ro 

(;  =  i,--.,9) 

(iib)     biQ\y,e,a,bi,---,bg,ru---,rio,ro  ~  6io|^,a,69,rio,ro 
(iii)     rj|y,^,a,6i,---,6io,ro~  rj\a,e,bi,- ■  •  ,bio  {j  =  1,...,10) 
(iv)      ro|y,0,6i,---,6io,ri,---,rio~ro|6i,---,6io 

(i=  l,...,m;j  =  1,...,10) 

These  distributions  under  conjugate  priors  follow  from  standard  normal  and  gamma 
distribution  theory  and  are  as  follows: 

(0    oc\y,e,bi,---  ,bio,ri,---  ,rio,rQ 


.:ij:/Jal 


1 

1 

] 
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(iia)    For(j  =  l,---,9): 

bj\y,6,ot,bj.i,bj+i,r.j,ro 

~  7V[(51  r,  +  2  ro)-Mr,  E^i(%  -  a^Ja) 
+  ro(6,+i+6,_i)},  (51  r,- +  ro)-^] 

(iib)    bio\y,0,a,bg,rio,rQ 

~iV[(51rxo  +  ro)-MnoEfIi(^io-<ioa)  +  ^069}, 
-        (51rio  +  ro)-'] 

(m)    ri,---,rio|2/,^,6i,---,6io,ro 

'^^  Gamma{\Z-lyE]ii{0„-x„a-b,)\  25) 


^•^ 


(iu)    ro|y,0,6i,---,6io,ri,---,rio  ^  -  ^  | 

(v)    %|y,6i,---,6io,ri,---,rio,ro  1 

~iV[(t^^i  +  r,)-i(t;-'2/.,+r,(x^a  +  6,)),  ; 

i 

■  -   .       ■  -1 

Simulations  from  these  complete  conditional  distributions  can  be  made  using  the  \ 

standard  random  number  generators  from  the  normal  and  gamma  distributions.  We  \ 

\ 
"\ 

can  now  estimate  the  marginal  posterior  density  of  6ij  \y  as  described  earlier  in  Section  ,j 

j 

2.2.    We  find  estimates  of  the  four-person  families  for  each  year  from  1979-1989 

(except  for  1980)  using  all  the  sample  estimates  available  since  1979  census  by  running  5 

■] 

50  iterations  of  100-2000  replications  each.    The  necessary  convergence  is  achieved  \ 

1 

after  500  samples.  We  denote  these  HB  estimates  by  HB^  As  a  special  case  of  our  = 

model,  we  have  also  computed  the  median  income  estimates  for  1989  utilizing  only 
the  census  median  income  figures,  the  CPS  median  income  estimates  for  1989,  and 


.J 


21 

the  PCFs  for  the  years  1979  and  1989.  We  denote  these  HB  estimates  by  HB^.  The 
HB  estimates  are  not  based  on  any  time  series  model,  and  are  based  on  a  univariate 
version  of  the  HB  model  considered  in  Datta,  Fay  and  Ghosh  (1991).  Table  2.1 
provides  the  statewise  sample  median  incomes  as  well  as  the  HB^  and  HB^  estimates 
for  1989.  The  standard  errors  associated  with  these  estimates  are  given  in  Table  2.2. 
Also,  for  most  states,  the  time  domain  HB  analysis  provides  reduced  estimates  of 
standard  errors,  when  compared  to  the  corresponding  standard  errors  under  the  non 
time-series  model  of  Datta,  Fay  and  Ghosh  (1991).  The  time  domain  HB  procedure 
results  in  substantial  reduction  of  the  standard  errors  when  compared  to  the  sample 
estimates. 

The  median  income  estimates  along  with  their  standard  errors  for  the  year  1979 
are  provided  in  Table  2.3.  Since  in  addition  to  the  sample  estimates,  the  1979  incomes 
are  also  available  from  the  1980  census  data,  we  compare  the  retrospective  estimates 
obtained  for  1979  by  using  the  time  series  model  against  the  1980  census,  treating 
the  latter  as  the  "truth."  The  1980  Census  figures  are  labelled  as  Truth  in  Table-2.3. 

Suppose  e.TH  denotes  the  true  median  income  for  the  i"*  state  ,  and  e,  any  estimate 
of  Cij,^.  Then  for  the  estimate  e  =  (ei,  ■  •  ■ ,  esi)^  of  ctr  =  (ei^,^,  •  •  • ,  egi^^)^,  let 


Average  Relative  Bias  =    (51)-^  Ef=i  |ei  -  e,j,«|  /e,- 


TR 


Average  Squared  Relative  Bias    =    (51)-^  Ef=i  |e,-  -  e,^^f  je]^^ 
Average  Absolute  Bias  =    (51)-^  Ef=i  ki  -  e.^.^! 

Average  Squared  Deviation  =     (51)~^  Yll=i  i^i  -  ^irnf 

The  above  quantities  are  provided  in  Table  2.4  for  the  sample  estimates  and  HB^ 
estimates.  The  HB'  estimates  are  clearly  superior  to  the  sample  median  estimates 
for  each  state.  In  terms  of  the  average  relative  bias,  the  HB'  estimates  improve  on  the 
sample  medians  by  80.32  %.  Using  the  creterion  of  average  squared  relative  bias,  the 
corresponding  percentage  improvement  is  97.01.   Similarly,  for  average  absolute  bias 
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and  average  squared  deviations  as  the  creteria,  the  respective  percentage  improvement 
of  the  HB^  estimates  over  the  sample  medians  are  80.88  and  96.27.  Thus  time  series 
HB  procedure  improves  tremendously  on  the  sample  estimates. 
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Table  2.1.  Median  Income  for  4-Person  Family  by  State  in  1989 

(In  Dollars  ) 


State 

Sample 
Median 

HB^ 

HB2 

1 

37700 

37332 

37987 

2 

52720 

47829 

48232 

3 

41489 

39604 

-  •   40086 

4 

55729 

52661 

52817 

5 

47106 

.   43424 

43629 

'  6 

55048   ] 

I        53203 

53020 

7 

43434 

.   43855 

43879 

8 

53128 

53436 

53305 

9 

40990 

40745 

40762 

10 

44241 

42589 

42550 

11 

40297 

39736 

39604 

12 

42590 

43075 

42817 

13 

43260 

43101 

42907 

14 

X 

42226 

41111 

41006 

15 

45432 

43631 

43439 

16 

38711 

37773 

37713 

17    ' 

39232 

38932 

38976 

18 

36966 

34946 

35068 

19 

32610 

32116 

32387 

20 

40894 

38326 

38419 

21 

42781 

38820 

38821 

22 

39819 

41924 

41724 

23 

48252 

50346 

49893 

24 

36587 

40270 

40384 

25 

49718 

45511 

45656 

»■ 
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Table  2.1.  continued 


State 

Sample 
Median 

HBi 

HB2 

26 

31604 

32141 

32217 

27       ; 

]l  \      37423  :j  .'■, 

37399 

37564 

28 
29 

31834 
45876  ■ 

34407 
41050 

34626 

.  41479 

30 

37194 

37827 

37876 

31 

32186 

33546 

33779 

32 

31327 

33843 

34041 

33 

32245 

34079 

34272 

34 

31330 

31559 

31922 

35 

33317 

32480 

32918 

36 

36219   . 

35123 

35118 

37 

40358 

35680 

35865 

38 

32679 

34247 

34117 

39 

29586 

32598 

32602 

40   ^  . 

32227 

33030 

33161 

41 

38931 

36451 

35852 

42 

36732   .. 

39591 

39294 

43 

29761  - 

31238 

31462 

44 

38969 

38828 

38868 

45 

37938  ■ 

36913 

36942 

46 

34579 

39129  - 

38722 

47 

42392 

41821 

41559 

48 

;   41419 

39342 

39256 

49 

41121 

42010     •: 

41820 

50 

41379 

48607 

47146 

51 

44999 

44785 

44369 

25 


Table  2.2.  Estimated  Standard  Errors  of  Median  Income  Estimates 
for  4-Person  Family  by  State  in  1989 

(In  Dollars  ) 
State  Sample  HB^  HB^ 

Median 

2147 
2450 
2011 
1999 
2355 
2379 
1318 
1635 
1349 
1583 
1974 
1472 
1374 
1757 
,^,  *  2083 
1734 
2141 
1794 
1679 
1899 
2184  ■ 
2164 
,»  ^  '  -  :  2450 
'  2336 
2362 


1 

3230 

2010 

2 

4132 

2327 

3 

2848 

1936 

4 

2452 

1946 

5 

4512 

2289 

6 

3893 

2245 

7 

1512 

1302 

8  - 

1933 

1575 

9 

1584 

1335 

10 

1845 

1562 

11 

2970 

1931 

12    .; 

1763 

1440 

13 

1597 

1341 

14 

2310 

1722 

15 

3251 

2030 

16 

2258 

1698 

17 

3677 

2098 

18 

2249 

1757 

19 

2130 

1630 

20 

2502 

1869 

21 

3359 

2134 

22 

3699 

2117 

23 

;.,,   -1665   ^  . 

2314 

24 

4645   *r  .  ' 

2280 

25 

4349 

2292 

'  -.  ^'^.-. 

'w 

Table  2.2.  Continued 


26 


State       Sample       HB^  HB^ 

Median 

1736 
1311 
2020 
2377 
1242 
1968 
1870 
1913 
1804 
1836 
2099 
2311 
1394 
.  1918 
1727 
2356 
2088 
1962 
2346 
2050 
2084 
1842 
2040 
1369 
2665 
2369 


26 

2278 

1695 

27 

1513 

1289 

28 

2965 

1971 

m 

4385 

2295 

30 

1397 

1232 

31 

2885 

1915 

32 

2446 

1830 

m 

2690 

1869 

U 

2392 

1733 

m 

2435 

1762 

m 

3314 

2039 

m 

.  3923 

2244 

■m  ' 

1552 

1405 

39 

2557 

1896 

40 

2268 

1691 

41 

3516 

2158 

42 

3261 

2045 

m 

2811 

1902 

44^  • 

5003 

2293 

45 

3209 

2004 

46 

2863 

2059 

41 

2525 

1788 

48 

3056 

1993 

49 

1586 

1359 

50 

3637 

2299 

51 

4805 

2278 

27 


Table  2.3.  Median  Income  Estimates  for  4-Person  Family  by  State  in  1979 

(In  Dollars) 


State 

Truth 

Sample 

HB^ 

Median 

SE 

Median 

SE 

1 

18319 

18084 

1533 

17967 

516 

2 

22027 

23129 

1647 

21919 

495 

3 

19424 

18438 

1642 

19055 

508 

4    . 

23772 

25324 

1414 

23805 

501 

5 

22107 

23597 

2409 

21967 

511 

6 

•   25712 

25037 

1431 

25596 

509 

7 

22669 

21852 

766 

22329 

423 

8 

26014 

25486 

1264 

25913 

502 

9 

22266 

23326  . 

878 

22330 

46.0 

10 

23279 

22397 

843 

22963 

439 

11 

23014 

22716 

1041 

22826 

451 

12 

25410 

■  ■■  24294   ■  r 

1025 

25168 

486 

13 

25111   . 

.   24103    • 

1006 

24872 

477 

14 

23320   , 
1 

,   23270  . 

1202 

23179 

467 

15 

24044   ' 

■   25137  ^ 

1571 

24021 

501 

16 

22351 

23851  '   '■ 

1592 

22294 

499 

17 

21891 

.   20743 

1255 

21563 

478 

18 

20511 

18567 

1535 

20094 

506 

19 

18674 

19826 

1588 

18449 

522 

20 

21438 

22603 

1802 

21297 

501 

21 

22127 

22014 

1586 

21930 

487 

22 

23627 

21860 

1900 

23388 

510 

23 

26203 

26235 

1722 

26173 

528 

24 

21862 

20232 

3801 

21623 

520 

25 

22757 

24160 

1418 

22736 

494 

28 


Table  2.3.  continued 


State 

Truth 

Sample 

HBi 

~ 

Median 

SE 

Median 

SE 

26 

20214 

18274 

1380 

19758 

506 

27 

19772 

20296 

1012 

19611 

465 

28 

19944 

19282 

1795 

19624 

506 

29 

20668 

22687 

1196 

20703 

517 

30 

21086 

19675 

1042 

20650 

474 

31 

19685 

18657 

1285 

19295 

489 

32 

19693 

19776  ■ 

1274 

19434 

483 

33 

19926 

17978   . 

1282 

19436 

506 

34 

18150 

19167 

1762 

17875 

534 

35 

17893 

18917  ' 

1507 

17639 

530 

36 

21412 

18965  . 

1444 

20964 

513 

37 

20659  ■ 

19295 

1675 

20311 

501 

38 

22521  ;" 

22963 

873 

22466 

435 

39 

20776 

;   21216 

1625 

20571 

494 

40 

19961 

21533 

1543 

19827 

510 

41 

24641 

22013 

1559 

24327 

530 

42 

23757 

26746  ■ 

1677 

23853 

531 

43 

19257 

19968 

1597 

19017 

509 

44 

21924 

21733 

1828 

21712 

496 

45 

21572 

20727 

1440 

21285 

484 

46 

.-.  24438 

25447 

2051 

24388 

516 

47 

24394 

25463  ;  ■ 

1415 

24401 

498 

48 

22688 

24251  ,.,   ■ 

1500 

22665 

499 

49 

24752 

24265 

767 

24559 

427 

m 

31018  • 

30788  :  . 

1891 

31159 

655 

51 

24966 

23744 

2263 

24830 

526 

29 


Table  2.4.  Average  Relative  Bias,  Average  Squared  Relative  Bias,  Average  Abso- 
lute Bias,  and  Average  Squared  Deviations  of  the  Estimates. 


Sample  Median 


HBi 


Aver.  Rel.  Bias 


0.04984 


Aver.  Sq.  Rel.  Bias  0.00340 

Aver.  Abs.  Bias  1090.41       ^  * 

Aver.  Sq.  Dev.  1631203.47 


0.00980 

0.00014 

208.45 

60782.92 


CHAPTER  3 
MULTIVARIATE  ESTIMATION  OF  MEANS  IN  TIME  DOMAIN 

3.1     Introduction 

Periodic  sample  surveys  usually  provide  sample-based  estimates  for  several  pa- 
rameters. The  inter-relationship  among  the  parameters  can  play  an  important  role 
in  improving  the  small  area  estimators.  Fay  (1987)  demonstrated  that,  even  when 
one  parameter  of  a  vector  is  of  interest,  estimating  the  joint  vector  may  improve  the 
estimation  of  a  given  parameter  compared  to  restricting  attention  only  to  the  single 
parameter.  In  particular,  he  suggested  improving  on  estimation  of  median  incomes 
for  four-person  families  discussed  in  the  previous  chapters  by  including  information 
regarding  the  median  incomes  of  three-  and  five-  person  families  as  well  because  of  a 
strong  statistical  relationship  among  the  three  sets.  While  the  HB  model  of  Datta, 
Fay  and  Ghosh  (1991)  involved  bivariate  data,  their  analysis  was  restricted  to  a  fixed 
point  of  time. 

The  univariate  HB  model  considered  in  Chapter  2  can  be  extended  to  the  mul- 
tivariate problems  simply  by  taking  the  observations  at  each  time  as  vectors  rather 
than  scalars.  In  Section  3.2  we  present  a  general  framework  for  multivariate  HB  anal- 
ysis when  the  error  variance-covariance  matrices  are  known  for  all  time.  This  section 
also  provides  a  methodology  for  obtaining  marginal  posterior  densities  through  Gibbs 
sampling  algorithm.  We  revisit  the  problem  of  median  income  estimation  in  Section 
3.3.   We  find  the  median  income  estimates  based  on  the  model  given  in  Section  3.2 
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for  the  period  1979-1989  for  all  the  states.  We  have  also  compared  the  estimates  for 
the  year  1989  with  the  corresponding  estimates  obtained  in  Section  2.4 

3^2 The  General  Multivariate  Hierarchical  Saves  Model 

Suppose,  based  on  a  given  sample  at  timej,  (;  =  1,  •  •  • ,  f )  that  F.j  =  {Yiji,  •  •  • ,  y;_„)^ 
is  a  s-dimensional  column  vector  of  sample  survey  estimators  of  some  characteristics 
^tj  =  (Oiji,  •  ■  • ,  ^.j,)^,  for  the  i^''  small  area  {i  =  1,  •  •  • ,  m).  As  in  Chapter  2,  our  ob- 
jective is  to  optimally  estimate  a  linear  function  YlT=i  T,]=i  ^J^.j,  using  all  the  data 
available  upto  time  t.  Consider  the  following  HB  model: 

I.  y.,|5,,  '^'  N{e,,,  V„)  (z  =  1, . . . ,  m;;  =  1, . . . ,  0; 

II.  0ij\oc,  bj,  rP^  '^^  N{Xi,<x  +  Zi,h,,  V',)  (^-  =  1,  •  •  • ,  m;  j  =  1, . . . ,  0; 

III.  b,\b,.,,W'^' N{H,h,_,,  W){j  =  l,...,t)- 

IV.  Marginally  a,  Vi,  •  •  • ,  V**,  and  VT  are  mutually  independent  with 
or  ~  Uniform  (/?''), 

V'j  ~   inverse  Wishart  {Sj,kj),   - '■  . 
W  ~   inverse  Wishart  (5o,  ko). 

In  (II),  Xij  {s  X  p)  and  Z,j  {s  x  p)  are  known  design  matrices. 

In  (IV),  we  shall  allow  the  possibility  of  diffuse  priors  for  7p^  or  W,  e.g.     Sj   = 

O,for(i  =  l,...,i),  So  =  0. 

The  hierarchical  structure  of  the  model  implies  that 


i-4 


.■"* 
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(0       aiy,^,6i,---,6t,'0i,---,V't,'W^~a|^,6i,---,6t,'0i,---,'0t 

(i,i/  =  l,...,f-l) 

(m)     V'j|y,^,Q:,6i,---,6t,Vr~-0^.|a,e,6i,---,6tO-  =  1,...,/) 
M    W\y,e,bi,--,bu'>P,,-,il^t--W\br,--,bt  ;■ 

{i  =  l,...,m;j  =  l,...,f) 

Formal  derivation  of  the  complete  conditional  distributions  is  now  straightforward 
and  is  omitted.  The  complete  conditional  distributions  are  given  by 

~  N,  [(Er=i  zu  ^JV'-^x,)-^  E-i  EU  xfi^J'  (^u  -  zlb,) , 

{iia)    For  (j  =  1,  •••,<-  1)  :       5  '  A 


iV„ 


33 


(iib)    bt\y, e,cx,bi,--,bt,il}i,--,'tl;t,W 


N„ 


-1 


{Hi)  Vi,---,V'tly,^,6i,---,6t,^ 

'~    Inverse  Wishart  [Sj  +  EI^i  i^ij  -  X.jOt  -  Zi^bj) 
X  {0ij  -  XijCt  -  Zijbjf  ,  kj  +  m] 

~  Inverse  Wishart  (Sq  +  E '=1  (6j  -  fl"j6;-i)  (6j  -  H,b,_if ,  ito  +  /) 

H  o,j\y,bu--,bt,ii,„...,i,,,w  :- 


iV 


A  Rao-Blackwellized  sample  based  density  estimate  of  dij  given  y  based  on  sim- 
ulated samples  from  the  above  conditional  distributions  can  be  approximated  by 

3.3     An  Illustrative  Application  to  Survev  data 

In  this  section,  we  will  illustrate  the  implementation  of  multivariate  HB  model 
described  in  Section  3.2  .    We  again  consider  the  problem  of  estimation  of  median 


income  for  four-person  families  by  state.  Fay  (1987)  suggested  improving  on  estima- 
tion of  median  incomes  for  four-person  families  by  including  information  regarding 
the  median  incomes  of  three-  and  five-  person  families  as  well  because  of  a  strong 
statistical  relationship  among  the  three  sets.  He  advocated  the  use  of  a  multivariate 
empirical  Bayes  (EB)  or  variance-components  approach  for  the  composite  estimation 
of  median  incomes  of  four-person  families  by  state.    Datta,  Fay  and  Ghosh  (1991) 
adopted  a  bivariate  hierarchical  linear  model  for  estimating  the  statewide  median 
incomes  of  four-person  families.  In  their  procedure,  one  component  was  the  median 
income  of  four-person  families,  while  the  other  component  was  a  convex  combination 
of  the  median  incomes  of  five-  and  three-person  families.  The  analysis  required  eval- 
uation of  3-dimensional  integrals.   Instead,  we  will  use  Monte-carlo  integration  via 
Gibbs  sampling  algorithm.  In  our  analysis,  we  will  be  considering  as  the  basic  data 
51  three  component  vectors  corresponding  to  the  median  incomes  of  four-,  three-, 
and  five  person  families  for  the  years  1979-1989  (except  for  1980).    Our  target  is 
to  estimate  statewide  median  incomes  of  four-person  families,  although  our  proce- 
dure furnishes  estimates  of  median  incomes  of  three-  and  five  person  families  as  well. 
Note  that  direct  numerical  integration  in  a  trivariate  set  up  will  require  evaluation 
of  six-dimensional  integrals,  which  is  fairly  complicated  and  often  unreliable.    The 
Gibbs  sampling  technique  avoids  such  problems. Our  analysis  rests  heavily  on  the 
implementation  of  the  Gibbs  sampling  algorithm. 

We  consider  as  the  basic  data  the  three  component  vector  Y ,,  =  (K^i,  >02,  V.^sf , 
z  =  1,  •  •  • ,  51;  j  =  1,  •  •  • ,  10,  where  yjji,  Y',j2,  and  Yij^  are  the  sample  median  incomes 
of  four-,  three-,  and  five-person  families  in  state  i  in  year  j.  The  true  medians  cor- 
responding to  Kj's  are  denoted  by  ^./s  respectively.  Let  Oij  =  {Oiji,  %2,  Oijsf-  We 
shall  also  write  as  y  =  (y„,  -  -  • ,  ^51.10)^  ,  0  =  (^n,  •  •  • ,  ^51,10)^. 

We  would  like  to  introduce  few  more  notations  before  describing  the  HB  model. 
Let  Xijii  and  x,ji2  denote  respectively  the  adjusted  census  median  income  for  the 
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year  ;  and  the  base-year  census  median  income  for  four-person  families  in  the  i"' 
state  (i  =  1,  •  •  • ,  51).  Also,  let  a;,j2i  and  x,j3i  denote  respectively  the  adjusted  census 
median  incomes  for  the  year  j  for  three-,  and  five-person  families.  The  corresponding 
base-year  census  median  income  for  three-,  and  five-person  families  are  denoted  by 
Xij22  and  Xij32-  Finally,  let,  .  ..v  ,  • 

■■.■"'  -  -  ',  i  .    ,V  ■    .<  '■  . 

(  1    Xijn    Xiji2    0       0  0       0       0  0     \     ■ 

X,j=\    0       0         0       1    x.,21    :r,i22    0       0         0         ,  (3.3.1) 

.       ;  V  0       0  0       0       0  0       1    Xi,3i    a:.,32  / 

i  =  1,  •  •  • ,  51,  j  =  1,  •  •  • ,  10,  and  denote  by  a  =  (ai,  •  ■  • ,  org)  the  vector  of  regres- 
sion coefficients.  Also,  bj  =  {bji  bj2  bjsf  ,  (j  =  1,  •  •  • ,  10)  is  the  vector  of  random 
componets  for  the  year  j. 

The  following  HB  model  is  used: 

L        Y,,\e,,'^'Nie,„  r.,)(i  =  l,...,51;i  =  l,...,10); 

II.  ^.■,|«,6„V'/~  yV(X.,a  +  6„  ^/',)(^■  =  l,•■•,51;7  =  l,•••,10); 

III.  b,\bj.^,W'^'N{b,.i,W){j^l,...,10); 

IV.  Marginally  a,  V'l,  •  •  • ,  ipf,  and  W  are  mutually  independent  with 
a  ~  Uniform  (R^),  , 

■0J  ~   inverse  Wishart  {Sj,kj), 
W~  inverse  Wishart  {So,  k'o). 

In  (II),  Xij  (3  X  p)  is  a  known  design  matrix. 

In  (IV)  we  assume  diffuse  priors  for  V'j  or  W,  e.g.  Sj  =  0,  for  {j  —  1,-  ■  ■ ,  10),  So  = 

0. 

As  mentioned  earlier,  closed-form  expressions  for  E(0ij\y)  and  V{Oij\y)  in  such 
a  set-up  can  not  be  obtained.   Also,  it  is  too  complicated  to  estimate  E{6ij\y)  and 
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y{^ij\y)  through  numerical  integration.  We  will  be  using  the  Gibbs  sampling  proce- 
dure to  simulate  samples  from  the  following  complete  conditional  distributions: 

(0    a|y,^,6i,---,6io,Vi,---,'0io,'^ 


A^„ 


(e^x  E]°i  ^;V'7^^u)"  E^i  Z]U  xl^pf  {9,,  -  h,) 


(e^ie;°xx;v-^x,) 


-1 


{iia)    For  {j  =  l,---,9)  : 

bj\y,  e,a,bi,-   ■,  bj-i,  6j+i ,  •  •  • ,  6io,  V'l,  •  •  • ,  V'io>  ^ 


~  ATo 


-1 


(51 1/)-^ +2w-^y 

{iib)    6io|y,^,a,6i,---,6io,^i,---,'0io>^ 

~iV3[(5i  vro +■^^"0"' 

{in)    ipu---,fpio\y,9,bi,---,bio,W  , 

•~    Inverse  Wishart  [Sj  +  Z]li  {^ij  -  X„ct  -  b,) 

.   ., ,.    _  x{ei,-Xii<x-b^f ,  bi 

{iv)    Ty|y,^,6i,---.6io,V'i,---,V'io     "  '      '^ 

~  Inverse  Wishart  (e]=i  (6j  -  6j_i )  {bj  -  6,_i  )^  ,  lO) 


'jji    %i  V'"  »■     ' 
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(v)    ^ij\y,bi,---,bio,ipi,---,ipio,W 

~  iV  [(V-.I  +  V-')-^  (v-.^y.,  +  ^-^  (X,,a  +  6,))  , 

With  all  the  complete  conditional  distributions  available  in  closed  form,  we  can 
now  estimate  the  marginal  posterior  density  of  Oij\y  as  described  earlier  in  Section  4.2. 
We  find  estimates  of  the  four-person  families  for  each  year  from  1979-1989  (except 
for  1980)  using  all  the  sample  estimates  available  for  three-,  four-,  and  five  person 
families  since  1979  census  (except  for  1980)  by  running  50  iterations  of  100-2000 
replications  each.  The  necessary  convergence  is  achieved  after  500  iterations.  We 
denote  these  HB  estimates  by  HB^.  As  in  Section  2.4,  we  also  consider  the  non  time- 
series  analysis  which  utilizes  only  the  census  median  income  figures  for  1979  and  the 
CPS  median  income  estimates  for  1989,  and  the  PCI's  for  the  years  1979  and  1989. 
In  this  case,  we  utilize  the  data  available  for  three-,  four-,  and  five  person  families  in 
contrast  to  using  only  the  information  for  four-person  families  as  in  Section  2.4.  We 
denote  these  estimates  by  HB''.  These  estimates  are  based  on  a  trivariate  version  of 
the  HB  model  considered  in  Datta,  Fay,  and  Ghosh  (1991).  Table  3.1  provides  the 
sample  median  incomes  as  well  as  the  HB^  and  HB'*  estimates.  The  HB^  estimates 
are  based  on  the  time  series  HB  described  earlier  in  this  section.  The  standard  errors 
associated  with  these  estimates  are  given  in  Table  3.2.  Again  in  conformity  with  the 
results  in  the  case  of  univariate  analysis  in  Chapter  2,  the  time  domain  HB  procedure 
results  in  substantial  reduction  of  the  standard  errors  when  compared  to  the  sample 
estimates.  Also,  the  standard  errors  associated  with  the  HB^  estimates  are  lower  than 
the  standard  errors  of  the  HB^  estimates  for  each  state. 

The  median  income  estimates  along  with  their  their  standard  errors  for  the  year 
1989  are  provided  in  Table  3.3.  Also,  as  in  Section  2.4,  we  compare  the  retrospective 
estimates  obtained  for  1979  by  using  the  time  series  model  against  the  1980  census. 
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treating  the  latter  as  the  "truth" .  We  also  compute  the  creteria  average  relative  bias, 
average  squared  relative  bias,  average  absolute  bias,  and  average  squared  deviation  (as 
defined  in  Section  2.4)  for  the  HB^  estimates.  These  quantities  are  provided  in  Table 
3.4.  Again,  the  HB^  estimates  are  clearly  superior  to  the  sample  median  estimates  for 
each  state.  In  terms  of  the  average  relative  bias  ,  the  HB^  estimates  improve  on  the 
sample  medians  by  87.62  %.  Using  the  criterion  of  average  squared  relative  bias,  the 
corresponding  percentage  improvement  is  98.24.  Similarly,  for  average  absolute  bias 
and  average  squared  deviations  as  the  creteria,  the  respective  percentage  improvement 
of  the  HB^  estimates  over  the  sample  medians  are  87.69  and  98.28.  Thus  time  series 
HB  procedure  improve  tremendously  on  the  sample  estimates. 

We  also  compare  the  HB^  estimates  for  the  year  1979  based  on  the  trivariate  time 
series  HB  model  with  the  HB^  estimates  based  on  univariate  analysis  in  Chapter  2. 
In  terms  of  the  average  relative  bias,  the  HB^  estimates  improve  on  the  HB^  estimates 
by  37.04  %.  Using  the  criterion  of  average  squared  relative  bias,  the  corresponding 
percentage  improvement  is  57.14.  Similarly,  for  average  absolute  bias  and  average 
squared  deviations  as  the  criteria,  the  respective  percentage  improvement  of  the  HB^ 
estimates  over  the  HB^  estimates  are  33.67  and  53.94.  Thus,  the  trivariate  time  series 
HB  model  captures  the  inter-relationships  between  the  median  income  of  three-, 
four-  and  five-person  families  to  improve  the  median  income  estimates  of  four-person 
families.  '""  ... 


""af^^W" 
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Table  3.1.  Median  Income  for  4-Person  Family  by  State  in  1989 
■  ''       ■     (In  Dollars  ) 


State 

Sample 
Median 

HB^ 

HB'' 

1 

37700 

37826 

38172 

■:,%- 

52720 

49130 

48988 

3 

41489 

1  • 

40889 

41118 

4   ,..,, 

-   55729 

53372 

52986 

5 

47106 

43723 

43578 

6 

55048 

54755 

54140 

7 

43434 

43760 

43513 

8 

53128 

53756 

53076 

S 

40990 

40582 

40508 

10 

44241 

42408 

42326 

11 

40297 

38250 

38121 

12 

42590 

42732 

42360 

13 

43260 

43405 

43090 

14 

42226 

41033 

40932 

15 

45432 

,  43043 

42748 

16 

38711 

.   37100 

37122 

17 

39232 

38512 

38491 

18 

36966 

35498 

35790 

19 

32610 

33137 

33514 

20 

40894   . 

38846 

38988 

21 

42781 

38934 

38990 

22 

39819 

43068 

42751 

23 

48252 

51029  , 

50229 

24 

36587 

39918 

39842 

25 

49718 

46250 

46019 

40 


Table  3.1. 

continued 

State 

Sample 
Median 

HB3 

HB'' 

26 

.     ■    31604 

;   31102 

31346 

27 

37423 

38078 

38174 

28  I,; 

./      31834   '  r 

35686  4 

35865 

29 

45876 

40665 

40830 

30 

37194 

~  '  •   36952 

36919 

31 

32186 

33722 

33976 

32 

31327 

33911 

34048 

33 

32245 

34192 

34365 

34 

31330 

31902 

32284 

35 

33317 

31516 

32021 

36 

36219 

34409 

34543 

37 

40358 

34252 

34551 

38 

32679 

34226 

34137 

39 

29586 

32865 

32944 

40 

32227 

33112 

33319 

41 

38931 

37177 

36842 

42 

36732 

40034 

39727 

43 

29761 

30098 

30441 

44 

38969 

38132 

38100 

45 

>   .    37938 

36890 

36953 

46 

34579 

38654 

38221 

47 

42392 

42561 

42316 

48 

41419 

39475 

39404 

49 

41121 

42787 

42421 

50 

41379 

-  49151 

47671 

51 

44999 

45806 

45262 

41 


Table  3.2.  Estimated  Standard  Errors  of  Median  Income  Estimates 
for  4-Person  Family  by  State  in  1989 

(In  Dollars  ) 


State 

Sample 

HB^ 

HB'' 

Median 

1 

3230 

1867 

2051 

2    ■•  . 

4132 

2129 

2281 

3 

2848 

1825 

1925 

•.-"* 

2452 

1710 

1848 

,$■     ■ 

4512 

2065 

2168 

6 

3893 

2291 

2399 

7    ,    :. 

1512 

1167 

1231 

8 

1933 

1389 

1545 

9 

1584 

1157 

1194 

10 

1845 

1350 

1399 

u 

2970 

1774 

1830 

■  If   ■■   \  ■■;- 

1763 

1272 

1342 

^  1$ 

1597 

1225 

1277 

14 

2310 

•   1581 

1630 

15 

3251 

1865 

1925 

16 

2258 

1501 

1560 

Xt 

3677 

1875 

1928 

18 

2249 

1537 

1615 

19 

2130 

1494 

1586 

20 

2502 

1687 

1737 

21 

3359 

1952 

2016 

22 

3699 

1972 

2024 

23 

24    -  •  ■  ■ 

4665 
4645  '  I 

2214 
2235 

2355 
2295 

25     '^   ■ 

4349 

2017 

2104 

■'.-■  „  ■-.;  ..  /-^ 


■"»" 
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Table  3.2.  continued 


State  Sample  HB^  HB'' 
Median 

26  2278  1515  1594 

27  1513     •  •■  1140       ~  1194 

28  2965  1766   ^  ...  1842 

29  .  ,  4385  ■  "^  2049  2148 

30  1397  1113  1154 

31  2885  1774  1839 

32  2446  1702  1761 

33  2690  1706  1768 

34  2392  1614  1701 

35  2435  1587  1704 

36  3314  1841  1931 

37  .  3923       ^   2031  2135 

38  '  '  1552  1287  1292 

39  •  ;    2557  .  1693  1731 

40  ;-.  -^  2268  ,  .1  '  1516  1573 

41  ..•■;.  3516  ■,'  .  2030  2280 

42  3261  1897  1944 

"'.■">    • 

43  2811  •■  '  ■  .^  .  1661  1773 

44  5003  2022  2089 

45  3209  1745  1809 

46  2863  1863  1914 

47  2525  1679      /%^  1749 

48  3056  1821  1881 

49  1586  1216  1235 

50  3637       -   2260  2607 

51  4805  2144  2252 
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Table  3.3.  Median  Income  Estimates  for  4-Person  F  amily  by  State  in  1979 

(In  Dollars  ) 


State 

Truth 

Sample 

HB^ 

Median 

SE 

Median 

SE 

1 

18319 

18084 

1533 

18206 

511 

2 

22027 

23129 

1647 

22032 

481 

V   ' 

19424 

18438 

1642 

19246 

498 

4 

23772 

25324 

1414 

23808 

467 

5 

22107 

23597 

2409 

22094 

499 

6 

25712 

25037 

1431 

25631 

493 

7 

22669 

21852 

766 

22414 

395 

8 

26014 

25486 

1264 

25938 

482 

9 

22266  • 

23326 

878 

22440 

453 

10 

23279 

22397 

843 

23061 

412 

11 

23014 

22716 

1041 

22819 

449 

12 

25410 

24294 

1025 

25167 

459 

13 

25111 

24103   . 

1006 

24963 

454 

14 

23320 

23270 

1202 

23322 

471 

15 

24044 

25137 

1571 

23928 

526 

16 

22351 

23851 

1592 

22375 

477 

17 

21891 

20743 

1255 

21695 

456 

18 

20511 

18567 

1535 

20211 

491 

19 

18674 

19826 

1588 

18621 

506 

20 

21438 

22603 

1802 

21395 

482 

21 

22127 

22014 

1586 

22025 

466 

22 

23627 

21860 

1900 

23476 

493 

23 

26203 

26235  '■ 

1722 

26202 

517 

24 

21862 

20232 

3801 

21737 

507 

25 

22757 

24160 

1418 

22866 

486 

44 


Table  3.3.  continued 


State 

Truth 

Sample 

HB^ 

'■  -^' 

Median 

SE 

Median 

SE 

26 

20214 

18274 

1380 

19890 

496 

27 

19772 

20296 

1012 

19702 

445 

28 

19944 

19282 

1795 

19757 

502 

29 

20668 

22687  .  ^ 

1196 

20824 

489 

30 

21086 

19675 

1042 

20717 

464 

31 

19685 

18657   . , 

1285 

19539 

484 

32 

19693 

19776   '  ■* 

1274 

19532 

471 

33 

19926 

17978 

1282 

19541 

510 

34 

18150 

19167 

1762 

18106 

517 

35 

17893  . 

18917 

1507 

17844 

517 

36 

21412 

18965 

1444 

21084 

491 

37 

20659 

19295 

1675 

20426 

489 

38 

22521 

22963 

873 

22501 

416 

39 

20776 

21216 

1625 

20646 

482 

40 

19961 

21533 

1543 

20008 

500 

41 

24641 

22013 

1559 

24412 

505 

42 

23757 

26746 

1677 

23944 

518 

43 

■   19257  ■ 

19968 

1597 

19167 

496 

44 

21924 

21733 

1828 

21861 

489 

45 

21572  -. 

20727  ' 

1440 

21366 

469 

46 

24438 

25447 

2051 

24416 

498 

47 

24394 

25463 

1415 

24399 

477 

48 

22688 

24251 

1500 

22720 

481 

49 

24752 

24265 

767 

24518 

418 

50 

31018 

30788 

1891 

30937 

660 

51 

24966 

23744 

2263 

24873 

508 

45 


Table  3.4.  Average  Relative  Bias,  Average  Squared  Relative  Bias,  Average  Abso- 
lute Bias,  and  Average  Squared  Deviations  of  the  Estimates. 


Sample  Median 


Aver.  Rel.  Bias  '  0.04984 

Aver.  Sq.  Rel.  Bias  0.00340 


Aver.  Abs.  Bias 
Aver.  Sq.  Dev. 


1090.41 


1631203.47 


'    t    r.  ^ 


HB^ 


A. 


0.00617 

0.00006 

134.27 

27996.90 


•      CHAPTER  4 
HIERARCHICAL  BAYES  PREDICTION  IN  FINITE  POPULATIONS 

?  '-,  4.1     Introduction 

■      * 

In  Chapter  2  and  Chapter  3,  we  introduced  HB  procedures  for  small  domain 
estimation  based  on  data  from  repeated  surveys  from  an  infinite  population.  In  such 
models,  we  assumed  that  the  observations  are  available  only  at  the  aggregate  level. 
There  we  derived  the  HB  estimators  for  the  current  as  well  as  previous  time  points, 
simultaneously  for  all  the  small  area  population  means. 

In  this  Chapter,  we  consider  the  problem  of  small  domain  estimation  in  the  context 
of  finite  population  sampling.  In  the  finite  population  setup,  we  assume  that  there  are 
m  strata,  the  i*''  stratum  at  time  j  contains  a  finite  number  of  units  Ni^,  and  the  unit 
level  information  on  the  characteristic  of  interest  at  time  j,  {j  =  I.- ■  ■  J)  is  available 
through  a  sequence  of  random  sample  surveys  of  the  given  strata  populations.  We 
are  interested  in  predicting  certain  parameters  (such  as  finite  population  mean  or 
total  or  change  in  mean  over  two  distinct  time  points  for  each  stratum)  of  the  strata 
populations  which  are  changing  over  time. 

We  describe  a  general  HB  approach  in  Section  4.2.  There  we  consider  the  most 
general  situation  where  all  the  variance  components  are  unknown.  Some  prior  dis- 
tributions are  assigned  to  the  vector  of  regression  coefficients  and  the  variance  com- 
ponents.  This  section  provides  the  predictive  distribution  of  unobserved  population 
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units  at  all  the  time  points  given  the  time  series  data  on  the  sampled  units.  The  com- 
putation of  the  estimation  of  population  means  simultaneously  for  several  small  areas 
at  all  the  time  points  through  the  implementation  of  the  Gibbs  sampling  algorithm 
(as  described  in  Section  2.2.1)  have  also  been  discussed  in  this  section. 

In  Section  4.3,  we  consider  a  special  case  of  the  general  HB  model  in  Section 
4.2.    We  assume  that  A,  the  vector  of  ratios  of  variance  components  is  known.    In 
such  a  case,  it  is  possible  to  find  the  posterior  mean  vector  and  posterior  variance- 
covariance  matrix  corresponding  to  the  characteristic  vector  of  non-sampled  units 
at  all  the  time  points,  in  a  closed  form.   In  subsequent  sections  4.4-4.5,  we  assume 
this  special  case  of  the  HB  model.    The  corresponding  HB  predictor  as  obtained 
in  Section  4.3  is  shown  to  possess  some  interesting  optimal  properties.    Datta  and 
Ghosh  (1991)  also  assumed  known  ratios  of  the  variance  components  in  a  special  case 
of  their  general  HB  approach  for  small  area  estimation.  The  HB  predictors  in  their 
special  case  have  been  shown  to  be  the  best  with  in  the  class  of  all  linear  unbiased 
predictors.    For  a  class  of  spherically  symmetric  distributions,  their  HB  predictors 
are  also  optimal  within  the  class  of  all  unbiased  predictors.    In  Datta  and  Ghosh 
(1990)  these  HB  predictors  dominate  stochastically  the  hnear  unbiased  predictors  in 
the  sense  of  Hwang  (1985).  Furthermore,  under  a  suitable  group  of  transformations, 
Datta  and  Ghosh  (1990-91)  showed  these  predictors  to  be  the  "best"  within  the  class 
of  all  equivariant  predictors  for  elliptically  symmetric  distributions.  In  the  set  up  of 
Datta  and  Ghosh,  the  HB  predictor  borrowed  strength  from  other  small  areas.  In  our 
set-up,  as  we  have  mentioned  earlier  in  Chapter  2,  the  HB  predictor  borrows  strength 
over  time  as  well  in  addition  to  borrowing  strength  from  other  small  areas.  Our  HB 
predictors  as  derived  in  Section  4.3  based  on  a  time  series  HB  model,  possesses  all 
the  optimal  properties  listed  above.  In  establishing  this  in  Sections  4.4-4.6,  we  have 
followed  the  approach  as  in  Datta  and  Ghosh  (1990-91).   In  Section  4.4,  it  is  shown 
that  the  HB  predictors  of  section  4.3  are  best  unbiased  predictors  within  the  class  of 
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all  unbiased  predictors.  Besided,  it  is  shown  that  these  predictors  are  BLUPs.  Section 
4.5  establishes  the  stochastic  dominance  (cf.  Hwang,  1985)  of  these  predictors  over 
the  LUPs  for  elliptically  symmetric  distributions.  Finally,  in  Section  4.6,  we  have 
shown  that  these  predictors  are  also  best  equivariant  predictors  for  both  the  matrix 
loss  (or  standardized  quadratic  loss)  under  suitable  group  of  transformations  for  a 
class  of  elliptically  symmetric  distributions. 


'      '*•    f,  w  »    -^  » 


4.2     Description  of  the  Hierarchical  Saves  Model 

In  this  section,  we  consider  a  hierarchical  model  for  prediction  of  the  characteristic 
of  interest  (like  the  population  total  or  mean  for  each  small  area  or  domain)  in  the 
context  of  finite  population  sampling  .  Let  Yijk  denote  some  characteristic  of  interest 
associated  with  the  ^•"'  unit  at  time  j  in  the  i*''  small  area  (k  =  l,---,Nij;j  = 
I,-  ■  •  ,t;i  =  1,  •  •  • ,  m).  For  notational  simplicity,  we  assume  that  the  characteristic 
of  interest  is  a  scaler.  The  results  can  be  easily  extended  to  the  case  when  Yijk  is  a 
vector.  Let  :• 

y^^>   -   (Y .....  Y-.    )T 

The  superscripts  (1)  and  (2)  are  used  to  denote  the  sampled  and  non-sampled  units 
in  the  i"*  small  area  at  time;,  respectively.  Nij  is  the  population  size  of  the  i*''  small 
area  at  time  j  from  which  a  sample  of  size  n,j  is  taken  {I  <  i  <  m;l  <  j  <  t). 
Xij    {riij  X  p)  and  Z-j    (n,j  x  q)  known  design  matrices  correspond  to  the  sampled 

units.  Xjj^^  {{Nij  -  Uij)  X  p)  and  Z\f  {{Nij  -  rnj)  x  q)  correspond  to  the  unsampled 
units.  ' 

We  consider  the  following  Bayesian  model: 
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I     I       'J 


^|!),-^AV-yv((^|),.^l^,(A)); 


(z  =  I,---,  m;  J  =  1,  •••,<) 

2     \   ind 


(  e^'^ 


-(1) 


r(i) 


a,6„cr^  A  '^"  iV  |^(^  ^^  j  a  +  (^  p,  j  6i,<T^V',(A)J  ; 


{i  =  l,---,m;j  =  l,---,f) 

III.    6,|6,_^,W,A'^^V(HA-l,  <72W(A));(j  =  l,...,0 

^  We  assume  that  V^,j(A),  rj}-{X),{i  =  l,...,m;;  =  l,---,^,  and  W(A)  are  struc- 

turally known  except  possibly  for  some  unknown  A  -  (Aj,  •  •  • ,  A,).  Hj{q  x  q),  {j  = 
I,-  ■  •  ,t)  are  known  matrices.  6o  is  assumed  to  be  known.  Without  any  loss  of  gener- 
ality, we  assume  6o  =  0.  .    .  ,  . 

Before  writing  the  stages  I-III  of  the  above  model  in  a  compact  form,  we  need  to 
introduce  a  few  notations.    We  denote  a,  u  x  u  identity  matrix  by  /„.    *  denotes  a 
null  matrix  of  suitable  order. 
We  define  •. 


cp) 


—       \^lt      i^2t       ^"'^^mt     j 


Y\l^     ).;   (z  =  l,---,m) 


—       I'-'lt       i»-'2«       '■■••*-^mt       I 
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Ai       -     {^-X-ii     ,A,2     ,---,A,V   j     ;(«  =  !,•••, m) 
^i      -    y-^ii    »^i2    ^■■■,-^it    J    ;(2  =  l,---,m) 


=   (x<"^jci"^...,x»'^)'- 


*  r      -  -v. 


U,i      =    nLi?*,  /  =  l,2,---,i;i  =  l,---,f 
=     /,  for/ =  ;-|-l,j  =  !,...,( 


Bi;>  = 


(1) 


nr(l)TT  7(1) 


rWTT  -7(1)1 


(1) 


B\t 


(2) 


yWrj  7(2) 

■^.2   *^22       -^j^ 


;7(2)rr  7(2)|-r 


4» 


7(2) 


B['^  = 


B\\ 


(1)  1 


,(1) 

'2t 


r(1) 


B\ 


(2)  _ 


o(2) 
■"It 
d(2) 
^2f 


B 


(2) 
mt 


V^=   BlockDiag  {Vn,---.V,uV-,u---.V.it,.--,Vrr.u---,Vmi) 
.     xl^i^   Block  Diag  [xjj^,-- -,11)  ^)  .. 

We  now  write  the  model  as  a  compact  linear  mixed  model  which  incorporates  all  the 
information  up  to  and  including  time  t  for  all  the  small  areas.  We  have,  without  any 
loss  of  generality,  listed  the  sampled  units  first. 


m-'' 
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.(1) 


C\ 


(2) 


A\ 


(2)     I  «  +    I      J2)      I  ^  + 


V 


(1) 

,(2) 


where 


w    ~iV(o,a2(/t0Ty(A))) 


»7 


.(2) 


N  {0,a^{{Im®AW)  +  Vt{X))) 


Writing 


Ct  = 


,  At  = 


A\ 
A\ 


(1) 


(2) 


,  Bt^ 


B\ 
B\ 


(1) 


(2) 


,  R  =  {cr') 


given  a,A,cr^,  and   r 


Ct  ~  iV  (^e  a,  r-'  Et) 


2\-l 


(4.2.1) 


(4.2.2) 


where 


Et  =  Et{\)  =  Bt  (It  QW)Bj  +  {Im  0  ^t)  +  Vt 


Furthermore,  we  partition  27,  as 


I^t  = 


^t2\       Et22 


where  i7ni  isn^xn,,  I7n2   (=  i^nj)  is  n<  x  (iV<-nt),  and  i:ni  is  (A^t-n,)  x  (A^t-n^). 
We  also  define 


1  ...••  1 


"    ii 


« -  ■    * 


S»  *' 


'.-*  *^  » 


■>■     '.-*;' 
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Our  objective  is  to  use  all  the  data  available  at  a  given  time  in  predicting 

where  Pj  and  P2  are  known  matrices.  In  particular,  we  may  be  interested  in  pre- 
dicting retrospectively  the  population  totals  for  the  small  areas  at  the  previous  time 
points  based  on  all  the  available  sampled  data  at  a  given  time.  To  this  end,  we  derive 
the  predictive  distribution  of  C|^'  given  C^^  =  c|^^  in  the  following  theorem.  A  proof 
of  this  theorem  is  given  in  Appendix. 

Theorem  A. 2.1  Consider  the  model  in  4.2.1  or  4.2.2.  Assume  that  a,  i?,  Ai /?,••• ,  \,R 
are  independently  distributed  with 

a       ~    Uniform  (RP)       p    "  '"      jjjv 


R       ~    Gamma  {^ao,^ho);ao>0, ho  >0 
XiR    ~     Gamma  (|a;,|/i/);/ =  I,--- ,5 


withai  >  0, /i/  >  0,  (/=  l,---,5).  Assume  that  rit  +  EUo  ^i  - P  >  2-  Then  conditional 
on  A  —  \,  and  Cf  =  Cf  ,  C^  is  distributed  as  multivariate  t  with  degrees  of 
freedom  rit  +  J2'i=o  hi  —  p,  location  parameter  MtCf     and  scale  parameter 


n,  =  ^n,  +  p  h,  -  p^       L  +  X:  atXt  +  c['^''k,cIA  G, 


(4.2.3) 
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Also  the  conditional  distribution  of  A  given  C<     =  Cf     has  the  pdf 


/(Aicf))  oc  ii;mr' A/i'^i^riUr^ 


.;   :-:>         x  Uo  +  ;^aA/ + +cr^V,cl'>) 


-iK+E;=o'"-p) 


(4.2.4) 


/=i    i 


Now  the  conditional  expectation  and  variance  of  Cp'  given  CJ^'  =  c[^^  can  be  derived 

as       ,  ■ 

-■■■  ':^     ;         ;:-;..   ,■ 


£(cp>|cr^)      =     £(M.|'cP)^r)"^' 


(4.2.5) 


(4.2.6) 


X  E 


ao  +  E«'A,  +  cp'   iiT.cl^'JGncp^ 


From  (4.2.5)  and  (4.2.6),  for  ^  >  1  we  can  find  the  posterior  mean  and  variance 

of  ^,    =    ^,(Ci'*,C|'')    =    PiCJ'*  +  P2C[''^  where  Pi  and  Pj  are  known  matrices 

.  The  Bayes  estimate  of  ^((C|'*,  c[^^)  under  any  quadratic  loss  is  its  posterior  mean 
and  is  given  by  '  •  .; 


eric^"^)  =    [p^+P2E{M,\c^^) 


,(1) 


Also, 


Note  that  when  Pi   =  ©^jPi,-  and  P2   =   ®-^iP2. 
where 


Pi.   = 


1^     ■• 
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and  ■    .'-    •     ■        .  :  ;:    /■ 

P2i    =     [0,    0,---,l^.,_„,,---,0] 

iti^t    iC'j    )  reduces  to  the  vector  of  finite  population  totals  for  the  m  small  areas 
at  time  j,    {j  =  l,2,---,t)  and  whereas  for  the  choice  Pi     =     ^^iPu/^ij  and 

-P2    =    ®'iLiP2i/Nij,  ^f{Cf    ,Cj  ^)  reduces  to  the  vector  of  finite  population  means 

for  the  m  small  areas  at  time  j,  (j  =  1, 2,  •  •  • ,  i). 

Also  for  t  >  2,  when  Pj   =   ©^jPi,  and  P2   =   e;iiP2.-  where 


Pi.   = 


0,  0,---,0,  - 


A^.(t-i)   "■^-'''  Nit   "" 


and 


^2.     = 


0,  0,---,0 


'    '      N^t-i)    ^•('-^-"•(.-1)'  jV^/a^..-".. 


^ti^t    r  C't    )  reduces  to  the  vector  of  change  in  mean  at  time  t  from  previous  time 
point,  for  the  m  small  areas. 

However,  in  order  to  find  the  conditional  distribution  of  Cp^  given  C^^  =  cp^ 
analytically,  we  need  to  find  the  posterior  density  /(A  |  cj^')  as  given  in  (4.2.4).  Even 
when  A  is  a  small-dimensional  vector,  it  is  not  easy  to  find  this  density  through  the 
use  of  sophisticated  numerical  or  analytic  approximation  techniques.  In  contrast,  this 
posterior  density  as  well  as  the  conditional  expectation  and  variance  in  (4.2.5)  and 
(4.2.6)  can  be  found  with  considerable  ease  in  a  more  general  case  when  tpj.,(j  = 
l,---,i)  and  W  are  unknown,  by  employing  the  Gibbs  sampling  algorithm.  It  is 
assumed  that  Vij  is  known.  In  such  a  case,  we  rewrite  the  model  as: 
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(I) 


ind 


9 


(1) 


«™  -^^^  UiM-""'^ 


»j  ;  ' 


where  V,j  = 


^tjll      ^012 


Vi 


ij21       y  tj22 


{i  =  l,---,m;j  =  1,  ••-,<) 


ind 


x\ 


o.,b,,R'^^  N  I  (    ':^^    )  a  +  I    ;|^,)    1  6„r-V,  1  ; 


"'J 


z< 


III.    b,\b,.„W,R'^'NiH,b,.ur-'Wy,{j  =  U...,t) 
We  assume  that  a,R,i}}-,---,  •^j,  and   W   are  mutually  independent  with 
a.      ~  Uniform  (R^), 
R      ~  Gamma(iao,  |/io)  ao  >  0,  /iq  >  0, 
V'j     ~  inverse  Wishart  {0,k^), 
W    ~   muerse  VFz^/iarf  (0,ko).  "' 

The  Gibbs  sampling  algorithm  requires  samples  from  the  following  complete  condi- 
tional distributions: 

•  b,\c['\c[^\e,R,a,xP^,---,tl;,br,---,b,.r,b,+r,---,bt,W,j  =  l,...,t 

•  ^i,-y,'^M'\c'i'\e,R,ot,bi,---,bt,W 

The  hierarchical  structure  of  the  model  and  the  conjugacy  of  the  priors  lead  to  the 
following  complete  conditional  distribution: 


56 


(0    a|cr>,ci'\^,r,6i,-..,6„'0i,---,V't,'W^ 
~  «|ce^\  ^,  r,  61,  •  •  • ,  6t,  V'l,  •  •  • ,  ?/»( 


K 


-1 


(ua)    ForO'  =  l,---,i-l)  : 

6j|cP',  c['^\  ^,  r,  a,  61,  •  •  • ,  6j_i,  6j+i,  •  •  • ,  6,,  Vi,  •  •  • ,  V't,  'W^ 
~6j|5,  r,  a,  61,  •  •  • ,  6j_i,  6j+j,  •  •  • ,  6^,  Vi,  •  •  ■ ,  ^i,  W^ 

-  iv,  [(Er=i  ^uV'7'^.7  +  ^J+i^-'^.+i  +  w-'Y' 
\:    X  {e:ii  ^JV'-^  (^^i  -  x,,a)  ;  '  , 

{lib)    6,|cp\  c!'\  tf ,  r,  a,  61,  •  •  • ,  6t_i,  t/^i,  ■  •  • ,  V',,  W 
~  6,1^,  r,  a,  61,  ■  •  • ,  6j_i,  ■01,  •  •  • ,  0^,  W 


A^„ 


: .  f  ri 


M* 


<  ^    i  ' 


•   V 


*      V** 
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'~    Inverse  Wishart  [rZT=i  i^ij  -  ^H^^  -  -^.j^i) 
X  (Bij  -  XijOL  -  Zi^bjf  ,  m  +  fcj] 

~  Inverse  Wishart  (r  Y.)=i  {bj  -  Hjbj.i ){bj-  H^bj^i  f  ,  t  +  ko) 

(v)    e,j\c\'\c\^\a,bu---,bt,iP„---,if;„W       \  ■,■ 


N 


.(2)|J1) 


(m)    y.yid  ',0,r,a,6i,.-.,6e,Vi,---,V't,^ 


7Y 


N,j-n,j 


^(2)    ,y.        ^-1     r„(l)_fl(in       V..,,V-1 


t    'i 


4.3     The  Hierarchical  Bayes  Predictor  in  a  Special  Case 


This  section  considers  a  special  case  of  the  HB  model  in  Section  4.2.  We  assume 
that  the  vector  of  the  ratios  of  variance  components,  A  is  known.  Our  target  is  again 
to  find  an  optimal  predictor  for 

for  some  appropriate  Pi  and  P2,  based  on  all  the  available  data  at  a  given  time.  To 

this  end,  it  suffices  to  find  the  predictive  distribution  of  Cj^^  given  C^^  =  c\^\  This 
is  done  in  the  following  theorem. 
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Theorem  J.S.I   Consider  the  model  given  in  4.2.1  or  4-2.2  with  A  known.    Assume 
that  a  and  R  are  independently  distributed  with 

a    ~    Uniform  (i?P) 

R    ~    Gamma  {^ao,lho);ao>0,  ho  >0 

Also,  assume  that  nt  +  ho-p>2.  Then  conditional  on  C^^  =  c[^\  CJ^^  is  distributed 

as  multivariate  t  with  degrees  of  freedom  rit  +  ho  -  p,  location  parameter  A/^cp'  and 

scale  parameter     ••  ,- *  ^  ^       >        s   'tv  » 

t 

«t  =  (nt  +  /io-p)~' (ao  +  ci'^^i<r,c!'')Gt        .'!  (4.3.7) 

Proof  of  this  theorem  is  similar  to  that  of  Theorem  4.2.1  and  is  omitted. 

In  this  case,  it  is  possible  to  obtain  the  Bayes  estimate  of  ^<   =  ^t{C[^\  Cp')  in 
a  closed  form  and  is  given  by 

ef(cS'^)   =   [P,+P,^(M.|cr')]cr>  (4.3.8) 

Remark  4.3.1  We  may  note  that  the  predictor  ef'{c\^^)  given  in  (4.3.8)  can  be 
obtained  without  using  any  numerical  integration  techniques  or  the  Gibbs  sampling 
algorithm.  ■-*.    , 

Remark  4.3.2  The  predictor  ef'{c[^^)  does  not  depend  on  the  choice  of  the  prior 
(proper)  distribution  of  R.  In  this  sense,  the  predictor  ef  "(cj^')  is  robust  against  the 
choice  of  priors  for  R.  To  see  this,  :    . 


V 
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E  {c?\&^^)  =e[e{e  (cPV,  R.  c^O  I^'  ^!'' }  l^-il 


It  is  assumed  that  all  the  expectations  appearing  above  exist. 

Remark  4.3.3  The  predictor  ef*(cpVf^t  =  ^tl^^^',  C*!^')  can  also  be  obtained  in 
an  alternative  manner.  Suppose,  first  a  known  and  r  may  be  known  or  unknown.  In 
such  a  case,  the  best  predictor  (best  linear  predictor  without  the  normahty  assump- 
tion) of  ^t{Ct  ,Ct  )  in  the  sense  of  having  the  smallest  mean  squared  error  matrix 
is  given  by      " 

Now,  when  a  is  unknown,  then  one  replaces  a  by  its  UMVUE 
(A/i''"2;-iA!''j'^Ae(^)''i;-lci''.   The  resulting  predictor  of  4,    =    ^,(c!'\c?') 
turns  out  to  be  ef '(cl^^).  In  this  sense,  ef*{c''^^)  is  also  an  EB  predictor  of  ^t(C'i^\  C'P^)- 

.     4.4     Best  Unbiased  Prediction  in  Small  Area  Estimation 

In  this  section,  we  prove  the  optimality  of  the  predictor  ef  *  within  the  class  of 
all  unbiased  predictors  of  ^^(CJ'',  Cp').    We  consider  the  model  given  in  4.2.1  or 
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4.2.2  with  A  known,  but  a  and  r  =  (cr^)"^  unknown.   We  do  not  assume  any  prior 
distributions  for  a  and  r  and  treat  them  as  unknown  parameters. 

Write  <?  =  (a^,  rf. 

Definition  4.4.1.        A  predictor  T{C[^^)  is  said  to  be  a  BUP  of  ^t(C[^\cp^)  if 
E^  [t{C['^)  -  UC['\  C|'^)l  =  0  for  all  <P  and  for  every  predictor  6{C['^)  of 


^,(C<'\ci'))  satisfying 

-  :\,  .    "■ .   ,  .       ■  ■•-      . 

%[6(C['))-^,(Ci'',Cp))]    =Oforall<P        '     '■     '  (4.4.9) 

V^  [5(Cp))  -  i,{C^^\  CP>)]  -  V^  [t{&^^)  -  U&^\  Ci^))]  (4.4.10) 

is  non-negative  definite  (n.n.d)  for  all  ^  provided  the  quantities  are  finite. 
The  BUP  property  of  ef  *  is  proved  in  the  following  theorem. 

Theorem  4-4-1   Consider  the  model  given  in  4.2.1  or  4.2.2  with  ct  and  r  —  [c"^)'^ 
treated  as  unknown  parameters  but  A  assumed  to  be  known. 
ef(Cp))   =   {P,^P,Mt)C^^^  istheBUPofUC\'\c\'^). 

The  proof  of  the  theorem  is  facilitated  by  the  following  a  general  lemma  which  is 
a  simple  extension  of  Lemma  3.3.1  of  Datta  (1990).  The  lemma  concerns  unbiased 
prediction  of  a  general  g{Ct)  {u  x  1)  based  on  c\^\  where  giCt)  is  not  necessarily 
equal  to  ^((C<  ,  Cj  )  or  linear  in  Cf  It  is  assumed  that  a  finite  second  moment 
exists  for  each  component  <7,(C<),  2  =  1,  •••,  u. 

Lemma  4.4. 1  A  predictor  T{C[^^)   E  Ug  is  BUP  for  g{Ct)  if  and  only  if 
\,       Cov^[T{C['^)-g{cW  m{C\'^)]    =  0  -  (4.4.11) 

for  all  g{Ct)  and  for  every  m  G  Uq,  Ug    =    Class  of  all  unbiased  predictors  6(Cp^) 
of  g{Ct)  with  each  component  of  6{C\^^)  having  a  finite  second  moment.    Uq  is  the 
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class  of  all  real  valued  statistics  (i.e.,  functions  ofC\^^)  with  finite  second  moments 
having  zero  expectations  identically  in  ^. 

Proof  of  Lemma  ff  ?    Lpf  T(n^^^)  ^.  IT^{n^^^)   . . .    rj^r-['))f        '. 
IfSiCi'^)  =  (<5i(Ci''),  ■  ■  •  ,,^„(Cf  *))^  is  another  predictor  in  Ug,  then 

V^  lTiC['^)-g{C,)]  +  V^  [5(Cp')  -  TiO] 

+Cov^  [r(Cr^)  -  giC),  5(Cp>)  -  TiO] 

+Cov^[6{C\'^)  -TiC,),T{C['^)  -  giO]  .      •     , 

Now  T,iC\'^)  -  6i(C\'^)   eUo  for  every  i  =   l,---,u. 
Hence,  using  the  condition  of  the  lemma,  for  1  <  i  <u, 


Cov^[T{C\'^)-g{C,),T,iCi'^)-6,{C\'^)]    =  0. 
From  (4.4.12)  and  (4-4.13)  it  follows  that 


^^[*(Cp')-ff(C,)]     =     V^\T{&^')-g{C,) 


+  V^\6{C^^^)-T{C,) 


(4.4.12) 


(4.4.13) 


(4.4.14) 


for  all  ^.  Hence  T{C[^^)  is  BUP  for  g{Ct). 

Only  if  Given  that  T(C|^')  is  BUP.  we  will  show  that  the  condition  (4.4. 11)  is  true. 
First  we  will  show  that  Ti{C\^^)  is  BUP  for  gi{Ct)  for  every  i  =  1,- ■  •  ,u.  Let  C/.(C|^') 
be  any  unbiased  predictor  for  gi[Ct).  Then  6*(C|^'),  a  u-component  vector  with  i  th 
component  equal  to  £/,(C|^')  belongs  to  Ug. 


Then 


Vr 


^ 


>(1) 


s*{cn-9{Ct] 


# 


T{C['^)  -  giC: 


,  /  V  '      if' 


^s      ?.  t 
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is  n.n.d.  by  hypothesis. 
So  we  have 


V^[U,{Ci'^)-g,iCt)]-V^[T,iC\'^)-g,iC,)]    >   0 


and  consequently  Ti{C\^^)  is  BUP  for  gi{Ct). 

Now  following  the  usual  LehmannScheffe  (1950)  technique  (also  Rao,  1952),  for  any 

m  e  Uo 


Cov^  [r.(Cp^)  -5.(C,),  m{C['^)]  =  0 


(4.4.15) 


for  all  I    <   i   <u.  Hence,  (4.4- H)  holds,  and  the  proof  of  the  lemma  is  complete. 

Remark  4.4. 1.     It  follows  from  the  proof  of  the  above  lemma  that  the  BUP  of  g{Ct) 
is  unique  with  probabiHty  one. 

Proof  of  Theorem  i.A.l  In  view  of  Lemma  44-h   it  suffices  to  show  that  for  every 

m(Ci^))  €  Uo, 

Cov^[ef*{&^^)-i,{C\'\cf\  miC^"^)]    =  0  for  all  ^. 


that  is  %  [Pi(M,C|''  -  C[^^)  m(C\'^) 
Since,  under  the  model,  '.  '^     i\-     "s? 


0  for  all  <?. 


.(1) 


using  E^[miC\  ')]  =  0,  it  suffices  to  show  that 


'<^ 


(A.(^)^i;-lcr')m(ci^') 


=  0 


(4.4.16) 


(1)^ 


for  all  ^.  Since  E^[m{CY^)]  =  0 
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^  |m(C<^')exp[-ir(ci^>-Ai^)arJC-\(ci^>-A<^'a)]c/cr>   =  0,   (4.4.17) 
differentiating  both  sides  of  this  equation  w.r.t  a,  one  gets  (see  p  318  of  Rao,  1973) 

xexp  [ir(cp)  -  A\'^afE;,\{c['^  -  Ap^a)]  dc\'^  (4.4.18) 

*  r  '  .       ^ 

=  0. 

Using  E^[m{c\^^)]  =  0  again,  (44.16)  follows  from  (44.18).   The  proof  of  Theorem 
is  complete.  V 

Remark  4.4.2.  We  can  also  prove  Equation  (4.4.16)  in  the  following  way.  Note 
that  since  C|^)  ~  iV  (Ap)a,  r-'Eni),  (^At^'^^ E;,\c\'\  C\'^'' K,ci'^^  is  complete 
sufficient  *.  Hence  A/^'   i;~i\c[^^  must  have  0  covariance  with  every  zero  estimator 


m  (Cp)),  i.e.,  E^  \{A,^'^'' i:;,\c\'^)m{C['^) 


=  0. 


Next  we  generalize  Theorem  4.4.1  for  certain  non-normal  general  distributions. 
Suppose  that 

A      =      Block  Diagonal  (Bt(It  ®  W)BJ,  Im®^t  +  V^) 

Assume  that  given  R  =  r,  e*  ~  iV(0,  r-'^A),  while  marginally  the  distribution 
function  of  R  is  an  arbitrary  member  of  the  family  J"  =  {F  :  Fis  absolutely  continuous 
with  pdf  /(r)  =  0  for  r  <  0}.  Let  J"*  denote  a  subfamily  of  JT  such  that  each 
component  of  ef '(CJ^')  and  ^^(Ci^',  C|^')  has  finite  second  moment  under  the  model 
4.2.1  and  the  joint  distribution  of  e*  and  R.  In  such  a  set  up,  we  now  prove  the 
BUP  property  of  the  predictor  ef '(CJ  )  in  the  following  theorem  which  is  a  simple 
extension  of  the  Theorem  3.3.2.  of  Datta  (1990). 


'^^'^■i^ 
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Theorem  IJ.2  ef*(C|")  is  BUP  o/^j(Cp\  CJ'^)  under  the  model  in  4.2.1,  e'\R  = 
r  ~  iV(0,  r-'^A)  and  R  has  a  df  from  T* . 

Proof  of  Theorem  4-4-^  ■     Using  lemma  4-4-i  o,nd  following  the  proof  of  Theorem 
4-4-i)  it  suffices  to  show  that 


Ea,F  [(A/^)^I7-l<^r  V(C|^')' 


for  all  a  6  i?"  and  for  all  F  6  T' ,  where 


0 


(4.4.19) 


Eot,F  [m{C^'^) 


=      0 


(4.4.20) 


and  Eoi,F  m?{Ct)^    <   oo  for  all  a  and  for  all  F  e  T* .  Consider  the  subfamily 

H={Rr^  gamma{\c,  iJ)  :  c>  0,  </  >  2}  of  T\  *  '  '^- ' 

Since  (4.4-20)  holds  for  this  subfamily  H,  Ea,F  [m(C|^')]    =   0  for  all  a  and  for  all 
F  E  7{  gives 


/o~exp(-icr)/r5(-+'')-iexp[-ir(ci^)  -  AS^)a)''i;-l(cp)  -  Ai'^a) 


(4.4.21) 


xmic[^^)dc[^Ur  =  0 


for  all  a,  c  >  0  and  d  >  2.  Now  using  the  uniqueness  property  of  Laplace  transforms, 
it  follows  from  (4.4-21)  that 


J 


r2 


i(n,+d)-l 


exp 


r    1 


^r(cr»-A|^'a)-i;-l(cr)-Ar)a) 


xm(cS'Vci'^      =     0 


a.e.  Lebesgue  for  all  r  >  0  and  all  a,  i.e., 


I 


exp 


ir(cr'-Ap)a)"i;-l(cr'->lPM 


xm{c\^^)dc\^^     =     0 


(4.4.22) 
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a.e.  Lebesgue  for  all  r  >  0  and  all  a. 

Differentiation  of  both  sides  of  (^.4-22)  with  respect  to  a  and  some  simplifications 

using  (4.4-22)  lead  to 


/(A/^)^i:-lcrV(cp^) 


xexp 


-ir(cP)  -  a[^We;M4'^  -  Ai^>a) 


dc\'^  =  0 


(4.4.23) 


a.e.  Lebesgue  for  all  r  >  0  and  all  a.  '.      t% 

Multiplying  both  sides  of  (4.4-23)  by  rs"'  and  integrating  with  respect  to  dF{r)  where 

F  eJ^\  one  gets  (4.4.19). 

Remark  4.4.3.       Since  J^'  does  not  contain  the  degenerate  distributions  of  R  on 

(0,  oo),  Theorem  4.4.1  does  not  follow  from  Theorem  4.4.2.  ^ 

Remark  4.4.4.       In  Theorem  4.4.2,  if  we  take  H  for  JF*,  we  see  that  the  marginal 

distribution  of  C«  is  given  by  the  family  of  distributions 

{7{-  \NuAtCx,  {cld)Eud)  :  a  e  /?P,c>  0,-(/  >  2}  and  ef*{c^^^)  is  BUP  for 

^((C|^',C^^')  for  this  family  where  ^(-  \Nt,Ata,  {c/d)IJt,d)  is  A^j-variate  t-distribution 
with  location  parameter  Atoc,  scale  parameter  (c/d)IJt  and  degrees  of  freedom  d. 
Next  we  dispense  with  any  distributional  assumptions  in  (4.2.1)  and  show  that  ef"(c|^') 
is  the  BUP  of  ^ti^t  ,  C<  )  within  the  class  of  all  linear  unbiased  predictors.  A  pre- 
dictor 5(Cp*)  is  said  to  be  linear  if  6{C[^^)  has  the  form  X(C|^*  for  some  known 
uXTit  matrix  Lt.  If  in  addition  %  [5(C|^')  -  ^t{C\^\  C[^^)]  =  0  for  all  ^,  we  say 
that  6{C[^^)  is  a  LUP  of  ^^(CJ^*,  Cp^).  We  now  introduce  another  definition. 


Definition  4.4.2.  A  LUP  StC[^^  of  ^t{C[^\  Cp')  is  said  to  be  a  BLUP  if  for  every  LUP 


is  n.n.d.  for  all  <?. 


V^ 


fs.cP^-e,(c|^cP) 
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We  now  prove  the  BLUP  property  of  ef*(c!^^)  in  predicting  ^j(c!^',Cp^)  in  the 
following  theorem. 

Theorem  i.4.3  Consider  the  model  in  (4.2.1)  and  assume  that  E^{ui)  =  0,  E^{tj)  = 
0,  E^{u})  =  0,  E^iijuj^)  =  0,  E^irj^rj)  <  00  and  E^{u;^u;)  <  00.  Then  e^*{C[''^) 
is  the  BLUP  of  ^t{C\'\c[^^). 

Proof  of  this  theorem  requires  the  following  lemma. 

Lemma  A.4.2  A  LUP  StC\^^  of  ^^{C\^\c[^^)  is  a  BLUP  if  and  only  if 


Co.^[s.Cr^-e.(Cp',Cp'),  m^cW 


=   0 


for  all  <P  and  for  every  known  Ut  x  I  vector  m  satisfying  E^{m^Ct)    =    0  for  all 

Proof  of  this  lemma  is  similar  to  the  proof  of  Lemma  4.4.1  and  is  omitted. 

Proof  of  Theorem  4.1.^  If  %(m^ci'')  =  m^^AJ'^a  =  0  for  all  a,  m^Aj''  =  0^. 
Hence 

=     P2{MtStu-St2i)m  ■.;■;■• 

=      0 

for  all  $.    The  last  two  equalities  follow  from  the  definition  of  Mt  and  from  the  fact 
that  rr\^A\      =   0^ .  Applying  lemma  4-4-2,  the  result  follows. 

Remark  4.4.5.     It  can  be  easily  proved  that  BLUP  oig{Ct)  is  unique  with  probability 
one.      .     ,     ,  t .    ?  ,  ■■  ' 


F..      r--:-   /..     •    •        ^  ■■... 
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4.5     Stochastic  Domination     '     • 

This  section  examines  the  stochastic  dominance  (as  in  Hwang  (1985))  of  the  HB 
predictor  ef '(ci^^)  within  the  class  of  all  linear  unbiased  predictors  of  ^<(Cf  \  Cp^). 
We  will  consider  the  model  of  Section  4.2  with  A  known.  As  in  the  previous  Section,  we 
will  treat  a  and  R  as  unknown  parameters.  Also,  we  write  e*  =  (u;-^,  t;-^)^  where  w 
and  T]  are  as  defined  in  Section  4.2.  We  dispense  with  the  normality  assumptions  on  u; 
and  rj.  In  the  previous  section,  we  established  the  risk  optimality  of  the  HB  predictor 
ef*{cl^^)  within  the  class  of  all  LUPs  of  ^t{C\^\   CJ^^)  without  any  distributional 

assumptions  on  e*.   The  optimality  of  ef'{c'i^^)  within  LUPs  holds  a  fortiori  under 
the  quadratic  loss 

=     {6  -  ^f  f2{6  -  Ct)  ;-•; 

=     tr[f2Loi^,,6)],  ■;'._ 

where  i?  is  a  n.n.d.  matrix.  We  shall  refer  to  such  a  loss  as  generized  Euclidean  error 


w.r.t.    O.    Let  Rl{^,^^-6)    =    E, 


$ 


5(cp^)-^,(c<^\cp)) 


2 


be  the  risk 


function  of  the  predictor  6  for  predicting  ^^  under  a  loss  function  which  is  a  function 
of  generalized  Euclidean  error  w.r.t.  H  for  some  function  L. 

We  now  assume  that  e*  has  an  elliptically  symmetric  distribution  with  pdf  given 

by  t      ..'  :     '     ^y  -   ^     '.     ^  i  .'  '..  '  '^ 

-'-  h{e*\A,r)     ex      \r-' a\~' f{re'^ A-'e*)  /  (4.5.24) 

A     =     Block  Diagonal  [Bt{I  ®  W)BJ,  (/„  0  tf^,)  +  Vt) 
and  the  known  non  negative  function  /  is  such  that 
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/ 


t       q  m      t     Ni] 

EEK-^l  +  EEEd^.-.^l  +  k-.^D  +  i 


'T  y^-l 


f{re*'  A-^e')de'   <   oo  (4.5.25) 


w 


here 


We  denote  this  distribution  by  Qj{0,r-'^A)  where  '^f{fi,f2*a'^)  denotes  the  distri- 
bution  whose  pdf  is  given  by  ■  '  [ 


^  I  -, 


k{a\fj,,0',a)  oc 


a^f2' 


■■f{{s-fxfQ%s-fjL)/a') 


(4.5.26) 


where  s  and  /i  are  in  R'',  ^'{p  x  p  is  p.d.  and  a  >  0. 

Note  that  the  normaUty  of  e  with  mean  0  and  variance  covariance  matrix  r~^  A  is 

sufficient  but  not  necessary  for  (4.5.24)  and  (4.5.25)  to  hold. 

It  follows  from  (4.5.25)  that  £(e')  exists  and  from  (4.5.24)  E{e*)   =  0 

Note  that  e"  =  r'^A'^e*  has  a  spherically  symmetric  distribution  Q=/(0,  /,.)  with 

characteristic  function  (c.f.)    E  exp{iu^ e")      =    c{u^u)  for  some  function  c  (see 

Kelker,  1970)  where  z   =   \/^,«  =   («i,  •  •  • ,  u,.)^  and  g*   =   Nt  +  tq. 
Hence  e*  has  c.f.  given  by  : 


E  [exp(m-^e')|    =   c{r~^u^Au) 

Now  write  D*j   =  Bt^^^w  +  e^^^    {j  =  1,2) 
Then  D;   =   (£>;/,  D^/f  has  a  c.f.  given  by 


(4.5.27) 


E 


.T, 


exp{iu'^  Dt") 


.-l^.'T 


c{r-^u'' Utu') 


(4.5.28) 


where 


St     =     B,{I®W)Bj  +  iI^0^t)  +  Vt 
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Comparing  (4.5.24)  and  (4.5.27)  one  can  see  from  (4.5.28)  that  D'  has  also  an  eUip- 
tically  symmetric  distributions  %(0,  r~'^St)  with  pdf  given  by    - 

h{d*\Et,r)    <x      \r-'Et\~'f{rdfE;'d;)  .-    •  (4.5.29) 

We  now  introduce  the  notion  of  "stochastic"  or  "universal"  domination  as  given 
in  Hwang  (1985). 

Definition  4.5.1.  An  estimator  6i{C[^^)  universally  dominates  S^iC^^^)  (under  the 
generalized  Euclidean  error  w.r.t  O)  if  for  every  <P,  and  every  nondecreasing  loss 
function  L,  Rl{^,  ^j;  5i)  <  Rl{^,  ^^■,  6i)  holds  and  for  a  particular  loss,  the  risk 
functions  are  not  identical.  •'-",!'.     , 

In  Hwang  (1985),  it  has  been  shown  that  6i  universally  dominates  62  under  the 
generalized  Euclidean  error  w.r.t  !7  if       .  >  . 


i'i  , 


•     ^.(cr')-^.(ci^ci^>) 

is  stochastically  smaller  than 

^2(ch-^.(cP',  cP>) 


2 


n 


The  following  theorem  estabhshes  the  stochastic  dominance  of  ef  "(CJ^')  over  every 
LUP  of  ii{Ci    ,  Ct    )  for  a  general  class  of  elliptically  symmetric  distributions  of  e". 

Theorem  4-5.1  Under  the  model  given  in  4-3.1  with  A  known  and  assuming  e'  has  an 
elliptically  symmetric  pdf  as  in  (4.5.24),  ef''{C\^^)  universally  dominates  every  LUP 
6{&^^)   =   StC\'^  of^,{C['\  CP^)  for  every  p.d.  f2. 

To  prove  this  theorem,  we  require  the  following  lemma. 

Lemma  A. 5.1  If  D{Nt  x  1)   has  pdf  h(d\I^^,r),    then  for  every  L{u  x  Nt),    u   < 

Nt,  LD  =   [L  L^Y^Du   =   (J„,  0)D  where 

0  (u  X  {Nt  —  u))  is  a  null  matrix  and  =  means  equal  in  distribution. 
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Proof  of  Lemma  1.5.1  .     The  -proof  follows  the  arguments  of  Hwang  (1985).    From 
(4.5.27)  it  follows  that  D  has  c.f.  E  [exp  [is^D)]  =  c  (r'^s'^s).  Hence 


E[exp{isjLD)]     =    c{r-'sjLL^s,) 
where  Si  is  a  u  x  1  vector.  Next  using  (4.5.30) 

J    "  v.. 

■    ■     E^exp(^isJ[LL'')'Du^]  ■    - 

.  •   "■      •  ••■■-  •  ■'  -     ,■        ti 

=     E  [exp  [isl [LL^f  {I^O)d\  '■ ' '  ^^'^  '^ 

=      c(r-i5f(££^)'(J„0)(/„0f(XL^)'5i) 

=     c{r-^s^LL^)s,), 
so  that  the  lemma  follows  from  (4.5.30)  and  (4.5.31).      ■ 
It  follows  as  a  consequence  of  Lemma  4.5.1  that 


D^L^LD     = 


LDf{LD) 


DlL  L^D, 


We  shall  use  (4.5.32)  repeatedly  for  proving  Theorem  4.5.1. 


(4.5.30) 


(4.5.31) 


(4.5.32) 


Proof  of  Theorem  4.5.1  Let  StC\^^  be  a  LUP  of  $,^{&^\  CJ^^).  Then,  under  the  time 
series  model  (4.2.1)  with  A  known  and  from  the  fact  that  E{e*)  =  0,  it  is  easy  to 
see  that 


{St-PMr  =  P2A\ 


(1) 


I  (2) 


(4.5.33) 
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Writing  D  =  Et^D*  and  using  (4.5.32)  and  (4.5.33)  one  gets 

=  D^Ei[St-Pi  -P2fn[St-Pi  -P2]s}d 

=  Djf2^[St-P,    -P2fSt[St-Pi    -P2]n^D^. 
Similarly, 

=  D'^[P2Mt     -P2fn[P2Mt     -P2]D' 

=  D^s\[P2Mt    -P2Yn[P2M.t    -P2]e\d 

=  Djfi'^P^Mt    -P2\St[P2Mt    -P2]^i7^r>„. 


Write  r   =   St -Pi-  P2Mt.   Th 


en 


rhs  of  (4.5.34)=rhs  of  (4.5.35)+DJ O^ rSniP^  fi' D^ 


since  using  the  definition  of  Mt. 


{P,Mt    -P2)Et 


0 


P2(M<  -/)(  ^''^  ]r'' 


>t21 


P2{MtEai    -IJt2i)r^ 

P2  K'  -  i:..i:nUr')  (Ap)"i;-Ui^')"Ar'V 


and  using  (4-5.33), 


(4.5.34) 


(4.5.35) 


(4.5.36) 


(4.5.37) 


rA\ 


(1) 


{St-Pi-P,Mt)A\'^  /    ■ 


'  ..  '.  * 


72 


=      0  _  (4.5.38) 

Theorem  (4.5.1)  follows  now  from  (4.5. 34)-(4. 5. 36). 

Also  since  Et\\  is  positive  definite,  it  follows  from  (4.5.36)  that 

rhs  of  (59)  =  rhs  of  (60) 

if  and  only  if  T  =  0,  that  is  St  =  Pi  +  P2Mt. 

4.6     Best  Equivariant  Prediction 

In  this  section  we  concentrate  on  the  equivariant  prediction  of  ^^(Cf^Cp^)  on 

the  basis  of  C^     under  suitable  group  of  transformations.  Assuming  A  as  known,  we 
rewrite  the  model  in  (4.2.1)  as 

■      Ct^  N{AtOt,  r-'St)  (4.6.39) 

where  •  .  •  . 

^   ,__    i:t  =  BtiI®W)Bj  +  (I^0^t)  +  Vt  .....■;..  (4.6.40) 

We  redefine  tp  —  [a  a^)        -  '  >' 


t 


We  are  interested  in  equivariant  estimation  of  Pi  Ai^^a  +  P2A.p'a  where  the  matrices 

■^t    ■>  At    ,  Pi  and  P2  are  as  introduced  earlier  in  Section  4.2.  It  suffices  to  estimate 

the  vector  ct  where  AtCt  can  be  viewed  as  a  location  vector. 

Note  that  Ct  +  Atl3  ~  N{At{a  +  /3), a^Et) 

We  are  interested  in  developing  an  estimator  of  a  which  remains  equivariant  under 

a  new  origin  of  measurement.  So  a  "natural"  group  of  transformations  will  be 

Qi  =  {g(3,(3^W  ■.gi3{Ct)  =  Ct  +  Atp]         ■.     ■■       ;.  '  (4.6.41) 
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Now  assume  that  we  partition  Ct  and  At  as  V 


Ct={    J2,      ,A,= 


If  <5(C<'0  estimates  (PjAp'  +  P2Ai")a  then  (5(Cp^)  +  PiA^^/?  +  PjAJ'^/?  should 
estimate 

(Pi a1'^  +  P2A|'')(a  +  /3)  =  (Pi   P2iA,(a  + /3) 

Treating  At(a+/3)  as  the  new  location  parameter,  one  can  expect  that  (5(C|^'+Ap^y3) 
(estimator  based  on  the  "observed  part"  of  Ct  and  At)  will  estimate 

(Pi Af )  +  P2Ap^)(a  +  /3)         F  !  -^  ^  >  V 
So  we  should  have 

6(cS^^  +  Ap>/3)  =  6(Cp')  +  PiA|^'/3  +  P2Ap'^  (4.6.42) 

for  all  Cf  \/3.  Now  if  we  are  interested  in  estimating  ^<(C|^\  C[^^),  instead  of 
,         £:^(e.(Ci^ch  =  P,Ar'a  +  P2A|^'a 

We  can  still  use  S{C\^^)  and  again  we  will  impose  (4.6.42)  on  S. 

Note  that  the  induced  group  of  transformations  on  the  parameter  space 

^  =  {<l>  :  <P  =  {a,  a^f,  a  e  i?^  <7  >  0}  '  (4.6.43) 

is  given  by 

^i^{9p,PeR'':g^{<^)  =  {a,a''  +  f3^f}  (4.6.44) 


A  loss  function  L{it,(j)\6)  is  said  to  be  invariant  for  predicting  ^t(Cj^\c!^')  by 
5(C|^))  if  ' 

=  L(^,(Cj,ci''),(^;5(Cr)))  (4.6.45) 

for  all  Cj    ,Ci    ,/3,and(/).  An  estimator 5  satisfying  (4.6.42)  is  said  to  be equivariant. 
We  will  now  be  interested  in  the  best  equivariant  prediction  of  -     ^ 

The  following  lemma  provides  a  useful  characterization  of  the  class  of  equivariant 
predictors  of  ^,(Cp\Cp>).         .  .■-f-'-.V^, 

^e"^"^Q  ^.g.j  .    Let  ^o(C|^^)  6e  an  equivariant  predictor  for  ^^(Cp\c|^').    T/ien  a 
necessary  and  sufficient  condition  for  a  predictor  5(Cj^')  to  he  equivariant  is  that 

:^   «5(c!^')  =  <^o(Cp')  +  h(Ci^^)  ■  -:  (4.6.46) 

for  all  Cp\  where  "" 

h(c!^)  +  Ar)/3)  =  h(ci^')  (4.6.47) 

/or  all  &^^  and  all  (3.  •    .        ' 

A  function  h  satisfying  (4.6.47)  is  said  to  be  invariant  and  hence  must  be  a  function 
of  a  maximal  invariant  function  under  the  group  of  transformations 

Q'i  =  {9'f3.i3eR':g'f^{&^^)  =  &^^  +  A^^^I3^  (4.6.48) 


^,- 
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induced  by  Qi  in  the  C^    -  space.  The  next  lemma  finds  a  maximal  invariant  function 
under  the  group  of  transformations  ^( . 

Lemma  4-6.2  A  maximal  invariant  function  under  the  group  of  transformations  Q[  is 
KtC^^"*  where 

Proof  of  Lemma  4. 6.1  First  we  show  that  KtC[^^  is  an  invariant  function.  Note  that 
and  hence  i        '     ^'»      *''    '        '--■  w> 

for  all  Cf     and  j3.  Now  we  will  prove  the  maximality. 
ForC|J\c|;>  such  that 

we  have  Cy^  -  CJ^^  is  orthogonal  to  the  row  space  o/K<. 
Also  rank  (K^)  -  Ut  -  p  and  rank  {A\^^)  =  p. 

Again  using  KfAf'  =  0.  it  follows  that  the  column  space  o/A^'  forms  an  orthocom- 

plementoftherowspaceofKt. 

Hence  CJ;'  -  clj'  =  a|''/3  for  some  (3,  i.e., 

proving  thereby  the  maximality  of  KtC[^\ 

Since  ef*(C|^')  is  an  equivariant  predictor  o/^t(C|^\  Cf),  using  lemmas  4. 6.1  and 

4.6.2  we  can  characterize  the  class  of  all  equivariant  predictors  o/^<(C|^\  C\^^).   This 
is  given  in  the  next  lemma. 
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Lemma  1.6.3  Under  the  group  of  transformations  Q\  given  in  (4.6.4I),  a  predictor 

S{Ct  )  of^^[Cf  ,Ct  )  is  an  equivariant  predictor  if  and  only  if  6  has  the  represen- 
tation 

6{C['^)  =  ef  (Cp))  +  1{K,C[''>)  ..  (4.6.49) 

for  all  Cf    ,  where  I  is  an  Ut-  variate  vector  valued  function. 

Define  the  loss  functions  -.'       • 

Loi^u  6)  =  {6-  Ct){6  -  ^tf,  Ii(^„  6)  =  trinLoiCt,  *)],  (4.6.50) 

where  f?  is  a  n.n.d.  matrix. 

Note  that  Lq  and  Lj  are  invariant  loss  functions. 
Definition  4.6.1.    An  equivariant  predictor  6o{C[^^)  of  ^j(Cp',C|^^)  is  said  to  be  a 
best  equivariant  predictor  under  the  loss  Lq  if  for  every  other  equivariant  predictor 

■  E4>[Lo{^u6)-Lo{^„6o)] 

is  n.n.d. 

Note  that  if  6o{C\^^)  is  the  best  equivariant  predictor  of  ^t(Cp\  C|^')  under  the  loss 
Lq,  then  it  is  so  under  the  loss  Li  for  every  n.n.d.  17.  Conversely,  if  6o(Ct)  is  the 
best  equivariant  predictor  oi  ^^{C\^\c\^^)  under  the  loss  Li  for  every  n.n.d.  f2,  then 
it  is  so  under  Lq.    Next  theorem  establishes  the  optimality  of  ef'iC*^^)  within  the 

class  of  all  equivariant  predictors  S(Ct)  of  6o(C(  )  under  the  loss  Zq,  and  a  fortiori 
under  the  loss  Li  for  every  n.n.d.  17. 

Theorem  4-6.1  Under  the  normal  time  series  linear  model  given  in  (4-2.1),  the  group 
of  transformations  Qi  given  in  (4-6.41)  and  the  loss  Lq  given  in  (4-6.50),  the  best 
equivariant  predictor  o/^^(C|^',  CJ^')  is  given  by  ef'{C\^^). 


^ 
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Proof  of  Theorem  1.6.1  Note  that  from  (4-6.49)  any  equivariant  predictor  6 {C]    )  of 
^ti^t^Cf)  is  given  by 


'(l)^  _  „B»fr^(i) 


(1)^ 


s{cr')  =  er{cn+iiK,cn 

Only  those  8  are  to  be  considered  for  which  £^^[Z/o(^t,^)]  is  finite. 
Note  that  EALQ{^^,ef'')]  exists  finitely  and  hence  ' 

£;^[F(K,Cr))l(K,ci^')]  <  2tr  {e^[U{^,,  ef )]  +  E^[Lo{i,,  6)\] 

Hence  by  Cauchy-Schwarz  inequality  J5^[l(KtC|^')F(KeC^^^)]  is  finite. 
Under  the  matrix  loss  Lq,  :       '  >.-. 


E^[Lo{^,,6)-Lo{^„ef*)] 


(4.6.51) 
(4.6.52) 

Now  we  will  show  that  the  last  two  product  terms  are  null  matrices.   Using  the  defini- 

'  '.    * 
tions  ofef*{Ct)  ^.^d  Mt,  we  have  '. ..'  '  ']:.  >  ' .,  ,.    .    . 


£;^[(ef(cr>)-e,(cl^cr')|C<^) 


=   P7 


[M.c|^'-£;^(cP'lcr))] 


=  P2  [MtC\'^  -  A^'a  -  i;m-^rn  (CP*  -  M'^ct)]  a.e.  Leb. 


=  P, 


Ar^-i:..:^-^nU'^'^i^nur^ 


-1 


xU(^)"i;-\cr»-A/^)^i;-Ui^'a 


(4.6.53) 
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Now 


Gov 


<t> 


A.^^y  e;,\&^\k,c\ 


(1) 


=    0 


Since  C/'  is  multivariate  normal  and  At^^^  iT^iicI^'  andKtC\^^  are  linear  functions 
ofCt  ,  so  At^S'^Ct  and  KtCt'  are  independently  distributed.  Now  using  (4.6.53) 
and  independence  o/A/*'   177i\c|'^  and  KtC\^\  we  have 


=  E^  [E^  {ef  (C^O  -  ^,  (CP\  CP))  |C!^^}  F  (/^.Cf)) 


xE 


</> 


A/^)"i;-\cp'  -  A,(^)^i;-\^r^«)  1"  {K,c['')] 


^0 


Now  from  (4.6.52)  and  (4.6.54) 


E^  [Lo  (e.  s)  -  Zo  (^e,  ef  •)]   :^  2=;^  [I  [k,&^^)  F  (i^.ci^))] 


>    0 


with  equality  if  and  only  if 


(4.6.54) 


(4.6.55) 


p^[i(i^.cr>)]    =   0, 


The  proof  of  theorem  is  complete. 

Next  instead  of  Qi  we  consider  the  group  Q2  oi  transformations  given  by 

g2  =  {9i3^^,(3eR'',d>0:gf^JCt)=dCt  +  At(3}  (4.6.56) 

A  predictor  5(C|^')  of  ^j(Cp*,Cp^)  is  said  to  equivariant  under  the  group  ^^2  of 
transformations  if 

6{dC['^  +  Ap^/3)    =    dSiCl'^)  +  (Pi  Ap)  +  P2  Ap))^  (4.6.57) 

for  all  C\^\l3  and  d  >  0.    Note  that  ef''{C[^^)  is  again  an  equivariant  predictor  of 

^ti^t  1  ^t  )  under  the  above  creterion.  Also,  note  that  if  6  is  equivariant  under  Q2 
taking  c?  =  1  in  (4.6.57)  it  follows  that  it  is  equivariant  under  Qx.  So  the  class  of 
equivariant  predictors  under  Q2  is  smaller  than  that  under  Qi.  But  we  will  prove  that 
^t  *{^t  )  is  best  inside  this  smaller  class  under  a  larger  family  of  distributions  of  Ct 
which  includes  the  normal  distribution  as  a  special  case.  We  will  apply  the  group 
of  transformations  Qi  on  the  family  of  elliptically  symmetric  distributions  with  pdf 
given  by 

:    .    .    f^{c,)<x\a^Et\-'y{{c^-AtOcYi:j\ct-AtOc)la^  "  (4.6.58) 

where  f  is  a  known  non-negative  function  satisfying 

'     j  [l  +  [ct  -  Atocf  E;\ct  -  Atoc) 

xf{{ct-Atafi;;\ct-AtCt)/a^)dct<  00  (4.6.59) 

Note  that  Ct  ~  N{Ata,  a^Ht)  is  a  special  case  satisfying  (4.6.58)  and  (4.6.59).  It  is 
easy  to  see  that  the  above  group  of  transformations  Q2  on  iVt-dimensional  Euclidean 
space  for  Ct  induces  a  group  of  transformations 


■i      a2  =  {y^,,,/3Gi?^J>o:^^,  =  (rf(r,da^  +  ^^f}  (4.6.60) 

on  the  parameter  space  given  in  (4.6.43). 

As  before,  a  loss  function  L{^f,(f)]6)  for  predicting  ^t{C\    ,  Cj  ')  by  6{C\  ')  is 
invariant  under  the  group  of  transformations  ^2  if 

-Qi.    =L{^t{C'i'\c'^^),cf>-6{C^^))         '        3.  A,         '  (4.6.61) 


•■■   r,-  ■/   t.. 


for  all  Cp\ci^\/3,d(>  0)  and  </). 
We  shall  consider  the  special  losses 

'   ,      L2iCt,<l>;S)  =  Lo{^„6)  /a'  ^  (4.6.62) 

and 

L3it,4>;6)  =  LrUu6)/a'  (4.6.63) 

Both  these  losses  satisfy  (4.6.61). 

To  find  the  best  equivariant  predictor  of  ^t{C\  ,0]  )  in  this  set  up,  we  will 
present  two  lemmas  which  provide  a  useful  characterization  of  the  class  of  equivariant 
predictors. 

Lemma  4-6.4  LetSoiC]  )  be  an  equivariant  predictor  for  ^^{Ct  \C\  ).  Then  a  nec- 
essary and  sufficient  condition  for  a  predictor  6{Cf  )  of  ^f{C[  ',Ct  ')  to  be  equiv- 
ariant is  that 

•   :  5(Cr')  =  5o(C|^')  +  /i(C|^')  ,.:..  (4.6.64) 

for  all  Ct    ,  where 


I  ^_.  =J      -V" 
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•.      h{dC'i'^  +  A\'^^)  =  dhic\'^)  (4.6.65) 

for  all  C\^\l3  andd>0. 

We  will  follow  the  arguments  of  Lemma  2  and  3  of  Datta  and  Ghosh  (1988)  to  find 
a  representation  of  the  function  h  in  (4.6.65).  Define 

.        p(li:,cl^')  =  iir,C|^V(Cp>'iir,cl^')^/^<,,r^^^o,^^j  (4.6.66) 

Note  that  since  KtSniKt  =  Kf,  so 

and  p  is  indeed  a  function  of  KtCt  ■  It  can  be  shown  that  p{KtCt  )  is  a  maximal 
invariant  under  the  group  of  transformations 

^2  =  {y;^,.'/^  ^R\d>o:  g'^,{C^^^)  =  {d&^^  +  Ai^^)/3}  (4.6.67) 

induced  by  Q2  in  the  C^    -space. 

The  following  lemma  characterizes  the  class  of  functions  h{Ci)  satisfying  (4.6.65. 
We  will  use  this  lemma  to  characterize  the  class  of  equivariant  predictors. 

Lemma  i.6.5  A  function  /i.(Cj  )(u  x  1)  satisfying  (4-6.65)  if  and  only  if  h  has  the 
representation 

■   ;    h{C'i'^)  =  {Ci'^^K,c\'^)h{p{KtC['^))  (4.6.68) 

where  s{u  x  I)  is  an  arbitrary  function  ofp{KtCt). 

Proof  of  Lemma  4-6.2  Assume  h  has  the  representation  given  in  (4-6.68).  Now,  since 
(c?Ci"  +  A^^^(3fKt{dC^^^  +  a|''/3)  =  ^2(<.(i)^2ir,Cp'  and  since  p{Kt&^^)  is  a 
maximal  invariant  under  Q'2,  so  p{K t{dC^^^ -\- A^^^ i3))  -  ^(/i'fCf  ^)  and  consequently 

h{d&^^  +  A^^^  0)    =    {d'C['^'K,C\'^)h{p{K,C'i'^)) 


'  t 


c,r 


«    ■ 
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Hence  (4-6.65)  is  satisfied,  only  if.  Since  h  satisfies  (4-6.65)  for  all  c]    ,/3  and  d  >  0, 

taking  d^l,  we  see  that  h  must  satisfy  /i(C|^*  +  A\^^f3)  =  h{C[^^)  for  all  C[^^  and 
f3.  This  implies  that  h  must  be  invariant  under  Q[,  and  hence  must  be  a  function  of 
KfCt    .  So  h{C^    )  =  s{KtCt    )  where  s{u  x  1)  is  an  arbitrary  function  satisfying 

i.e.s{dKtC\''')^ds{KtC['^)  (4.6.69) 

for  all  d  >  0.  Now  taking  d  =  {C\^^^ KtC[^^)-2  for  all  C[^^^ K tC[^^  >  0  we  have 
from  (4.6.69) 

Now  for  C\^^KtC\^^  =  0,  if  we  take  h  =  0,  then  (4.6.65)  is  satisfied  and  we  can 
represent  h  by  (4-6.68).  ..     _ 

^^  Since  ef'{Ci  ')  is  an  equivariant  predictor  of  ^^(C<  ,  C(  '),  it  follows  from  lemma 
and  lemma  that  5(CJ^*)  is  an  equivariant  predictor  of  ^t(Cp\  CJ^')  under  the  group 
Q2  of  transformations  if  and  only  if  :h 

5(cP)  =  ef*(Cp))  +  (Ci^)'ii:,c!^))^s(p(ii:,Cr)))     ■.  (4.6.70) 

foralIcS^\  '■       '.  - 

Definition  An  equivariant  predictor  5o(C|^')  of  ^((C|^',  Cj^*)  is  said  to  be  a  best 
equivariant  predictor  under  the  loss  Z2  if  for  every  other  equivariant  predictor  f){C\    ) 
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E^[L2i^,,cf>;6)-L2i^,,cf>-,6o)] 

is  non  negative  definite. 

Remark.  Note  that  if  6o{C]  )  is  the  best  equivariant  predictor  of  ^tiC]  ,  Cj  ) 
under  the  loss  L2,  then  it  is  so  under  the  loss  L3  for  every  n.n.d.  J7  and  vice  versa. 

We  now  establish  that  ef'(C|^')  is  best  equivariant  predictor  of  ^((Cj  \C\  ') 
under  the  loss  L2. 

Theorem  4-6.2  Under  the  time  dom,ain  HB  model  given  in  and  the  family  of  ellipti- 
cally  symmetric  distributions  as  given  in  (4.6.58)  and  (4-6.59),  the  group  of  trans- 
formations Q2  given  in  (4.6.56),  and  the  loss  function  L2  given  in  (4.6.62),  the  best 

equivariant  predictor  of  ^f[C[    ,  C|    )  is  given  by  ef*{Cl    ). 

t  ■  . 

Proof  of  Theorem  4-6.2  Note  that  from  (4-6.70)  any  equivariant  predictor  6{Ci  )  of 
^t{Ct,Ct)  is  given  by 

6(cp))  =  ef  (cp')  +  (cr  ii:eCr^)F*(p(ii:.cp')). 

Only  those  6  are  considered  for  which  EA.{L2{^t,4>'i^))  is  finite  and  in  that  case  for 
the  corresponding  s.  we  have 


E. 


<t> 


'&^''k,C^^^s  [p{K,&^^))  s^  {p{K,C[^^)) 


is  finite.   Under  the  matrix  loss  L2, 
a'E^[L2{^„<j>;6)-{^„<p-ef*) 

=  E^[c\'^"K,C['^s{p{K,c\'^))s''ip{K,c['^)) 


+E, 


4> 


+E^ 


iefici'^)  -  ^,{C\'K  C?^)KC[^^'K,C['^)h^{p{K,C['^)) 


(C|^'^K.cP^)i.(p(i^.C|^'))(ef  (C!^')  -  ^,(Cl^  C!^))) 


(4.6.71) 


It  is  enough  to  prove  that  the  last  two  product  terms  in  (4-6.71)  are  null  matrices. 


T.-r-^  ■'.■--■. 'f^' 
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E^  [ef  (Cp))  -^,(Cp',ci^^)|Cp']  =  P,  [M,ci'^  -  E^{C?\C\'^)]   (4.6.72) 
Now  from  the  property  of  elliptically  symmetric  distributions,  it  follows  that 

and  using  this,  we  have  from  (4-6.72)  that 


Then,  the  second  term  of  the  rhs  of  (4-6.71) 


(4.6.73) 


«i-\  >i'^*i     _         JT- 


xE 


<i> 


{cJk,c,)Ha,^'^^e;,\&^^  -  A.(^)''i:-Up^a) 


.    XS^(p(iireCP^))] 


t   i 


(4.6.74) 


Now  we  will  show  that  the  expectation  of  the  rhs  of  (4.6.73)  is  a  null  matrix.  Note 
that  since  E^(C^^^^ KtC\^^)  <  oo)  and  %(Cp')  =  A^^a,  by  Cauchy-Schwarz 
inequality,  (C[^'  KtCt  )^At^^^  IT^/iAJ^^  has  finite  expectation.  Now  using  the  in- 
dependence of  {{C[^^^ KtC[^^)KAt^^^^ I!;i\C\^^)  and  p(KtC\^^),  we  have 


E 


4> 


(c/i^.c,)i(A.(^)^i;rilci^)  -  a,^^^^e:,\a^^^cc) 

xs^(p(/^,C<^)))] 

E^  [{C,^K,C,)'^AfE;,\{&^^  -  A^^a)] 

xE^  [s^(p(iir,cp)))] 


(4.6.75) 


Next  using  the  elliptical  symmetry  of  C'    ,  it  follows  that 
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Now  since  KtAy  =  0  and  Kt  is  n.n.d.,  one  has 

(Cr>  -  A\^^afK,{c\'^  -  Ap^a)]'  [Af  E-\iC['^  -  A^^c)] 
{-iC\'^-A^^a)YK,{-{C\^^-A^^a)} 

xAfi:;,\{-ic\^^-A[^^c.)}      :  ^'^ 

i.e.(CjKtCt))'^Af{C['^  -  a\'^cx)         , 


(4.6.76) 


Now  taking  expectations  on  both  sides  which  exist  finitely  due  to  the  previous  obser- 
vation, one  gets 


E 


<P 


iC'K^O^A'^  i:;,\{cY>-AY>a) 


W/-^(i)        ^(1). 


0 


(4.6.77) 


Hence  from  (4.6.71),  (4.6.74),  (4.6.75),  and  (4.6.77)  we  have 


a'E 


<t> 


L2{it,cl>:6)-{^,,4>;efn 


'4> 


.(1)^ 


t(1) 


(i)\\„r/ 


.(1)^ 


{CV  K,Cns(p{K,Cn)s'  {p{K,Cn) 


which  is  n.n.d.  for  all  (f>  and  the  two  risk  matrices  are  equal  if  and  only  if 
P^  [s{p(K,C\'^))  =  O]  =  1  forall  cj, 

i.e.,  P^  [6{C\'^)  =  ef'id'^)]  =  1  forall  <j> 

Remark.  Although  (Cf  "'/iT^Cp^)?,  A/i^^i;ri\c|^^  is  sufficient  and  p{KtC^^^) 
is  ancillary,  Basu's  theorem  can  not  be  apphed  since  the  sufficient  statistics  is  not 
complete. 


^:f:,:'    r  '."- :".:  r-  *  ."'«'**'     ^^      ':-.''>' 


CHAPTER  5 
SUMMARY  AND  FUTURE  RESEARCH 


5.1     Summary 

In  this  dissertation,  an  estimation  and  prediction  methodology  is  developed  for 
simultaneous  estimation  of  several  strata  means  at  different  time  points,  based  on  data 
from  repeated  surveys.  The  methodology  adopts  a  HB  model  which  incorporates  time 
dependent  random  components.  The  resulting  HB  estimators  for  a  small  area  "borrow 
strength"  over  time  in  addition  to  borrowing  strength  from  other  local  areas  as  has 
been  the  case  in  most  of  the  existing  methods  of  small  area  estimation.  The  Bayesian 
procedure  provides  current  as  well  as  retrospective  estimates  simultaneously  for  all 
the  small  areas.  Application  of  this  HB  procedure  to  the  median  income  estimation 
problem,  in  Chapters  2  and  3  has  demonstrated  the  usefulness  of  this  procedure  in 
providing  substantially  improved  estimators  for  small  areas. 

Implementation  of  the  HB  methodology,  in  finite  as  well  as  in  infinite  population 
sampling  situations  have  been  illustrated  by  adopting  a  Monto  Carlo  integration 
technique  known  as  the  Gibbs  sampler.  Using  this  procedure,  in  both  the  univariate 
and  the  multivariate  cases,  the  posterior  density  as  well  as  conditional  mean  and 
variance  can  be  obtained  with  considerable  ease. 

The  HB  predictors  of  a  vector  of  linear  functions  of  the  finite  population  obser- 
vations, in  the  special  case  of  known  ratios  of  variance  components,  are  shown  to 
be  best  unbiased  linear  predictors  without  any  distributional  assumptions.  It  is  also 
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shown  that  these  predictors  stochastically  dominate  all  the  linear  unbiased  predic- 
tors for  elliptically  symmetric  distributions  and  are  best  equivariant  predictors  under 
suitable  group  of  transformations. 

,      •  5.2     Future  Research 

The  HB  analysis  considered  in  this  dissertation  assumed  the  variance-covariance 
matrix  for  the  ba^ic  observations  as  known.  However,  often  in  practice,  only  the 
sample  variance-covariance  matrix  is  available.  We  can  extend  our  HB  model  by 
setting  either  proper  or  improper  priors  on  the  unknown  variance-covariance  matri- 
ces. We  also  assumed  that  the  process  through  which  the  random  components  evolve 
through  time  is  known.  The  case  when  this  process  is  unknown  needs  further  in- 
vestigation. Also,  the  application  of  general  technique  of  "borrowing  strength"  over 
time  can  be  explored  in  other  application  areas  e.g.  in  multicenter  clinical  trials  in 
biomedical  research  or  in  on-line  quality  assessment  of  different  production  lines  in  a 
manufacturing  industry. 

It  is  important  to  investigate  the  robustness  of  the  HB  procedure  considered  in 
this  dissertation,  to  handle  the  case  of  heavy  tailed  priors.  It  is  also,  worthwhile 
to  extend  our  Bayesian  model  to  allow  for  the  presence  of  outliers  in  the  data  by 
assuming  scale-contaminated  normal  distributions. 
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APPENDIX 
PROOF  OF  THEOREM  4.2.1. 


The  joint  pdf  of  Cj,  a,  R  and  A  is  given  by 


/(ct,a,r,A)ocr2(^'+E;=.'")-i 


i^ti" 


1  > 


X  exp 


1 
2 


(ct-Atafi:r^c,-A,a)  =    [a  -  {At^ U;' At)  'At'^E;'ct]^ 


where 

Q,  =  i:r' -  ^r'^t(A,^i:ru,)"'Afi:r^ 

Integrating  (1)  w.r.t  a,  the  joint  pdf  of  Cj,  i?  and  A  is  given  by 
/(C(,r,A)      oc      |^<r^|At^2;r'A,r^r2(^<+E,=i''<-p) 


X  eip 


~o  ("o  +  X!!"''^'  +  ct^QtCt) 


1=1 


i=i 


U,-i 


Integrating  (3)  w.r.t  R,  the  pdf  of  Ct,  and  A  is  given  by 

fict,X)     oc      \i:t\~'^\At'^I^;'At\~'[ao  +  ^a,Xi  +  Ct^Q,Ct)) 


(1) 


(2) 


(3) 


j(^«+e;=o'"-p) 


as 
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xfi^f''-'  (4) 

Now,  using  a  standard  formula  for  partitioned  matrices  (e.g.,  Searle  (1971),  page  46), 
we  have  ■<);  . 

X  (cp)  -  i:,2ii:nlcl^')  V    ■■'^  (5) 

Similarly 

X  (Ct     —  17<2i-S'jiiA(    ) 

'■■  ?hi;^  '^^   =    .r  +  4,      :'    •         [?V;    ^^  "■■  (6) 

" '   *'  ;■•■       '  ^        '■.    . ;  ' 

^  .^  ^     x(AP'-i:,2ii;;i\AS^')  (7) 

'w?s^7^.*        .......  .    .    ^  '^ 

From  (7)  ''  ' 

(a.(^)^i;-\aP')"  -  (A.(^)^i;-Ul^')"(Ar'  -  2:.2ii:nUr')'' 
X  (i:,22.i  +  {a\'^  -  SnxS7,\A^'^)  (A,(^)''i:-UPV 

,■        (AP'-i;.2.2;aU<^')7 

X  (aP'  -  i;.2ii:nl^r')(A/^>''i;- aPV' 

X  Gr^(^r  -  2:.ni:riUP^)(V;'''i:riUP')"' 

=   Mi+M2(^'a?/)  ,        ■    V  (8) 


-1 


-1 


Thus  /    '" 

=        (sf  +  s[)(Mi+M2)(Si+S2) 
=        sfMiSi  -  sfM2Si  +  S^MiS2 

-s[M2S2  +  2sf(Mi  -M2)S2  (9) 

sfMxSi      =     ci^^''(^-l  -  kM'^     .  '   -'■  :,-  .    ^,  ^  .  (10) 

s^M^s^    =    (M,ci''-i:,2ii;ri\cr^fGrnM,cp)-i;,2ii;rncr') 

S2M1S2        =        (C(       —  X'(21-S'fiiAt     )     (-S't'^2.1^*-^t22.1  ~  '^t22.l) 

X  (cp' -  i:,2i2;nUl'^)  ;  ..,     :  (11) 

S2^M2S2  =  (ci''-i:.2i^rnc|^')''(i:r2^2.iGti:r22.i-2-^r22.iGe-^) 

x(cP'-2;e2ii:rnCr^)  :  (12) 

sfMiS2    =    (M,cr>-i;,2ii:ruci'^)''i:r22.i(ci"-i^m2:rncr))  (13) 

Now  starting  with  the  joint  pdf  of  Cp\  a,  7?  and  A  we  get  the  pdf  of  Cp'  and   A 

given  by  .  ' 

ao  +  T,^'^'  +  c^^  K,c\'^)]  nV'"'        (15) 

/=i  /  /=i 

So  the  conditional  pdf  of  CJ      given   yl   =  A   is  given  by 

McPM  Cp'  =  cr\  A  =  A) 


Fk-rr^"- 


'  f   »  ^  ' 


u 


oc 


oc 


f{cu  A) 
McP',  A) 


\EA  VU     \A^''E;'At\ 


l^mi;    \,|A/^)''i:-iAP'| 


From  (7) 
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(16) 


\Ai'ET'A^\ 


j(2)  y^  y^-I     .(1)  ^ 

T 


I  (2) 


1    4(1)^ 


Et22.1  +  {At       —  Et2lEt^iAt     ) 


^  |-^t22.l| 


by  using  Exercise  2.4  on  page  32  of  Rao  (1973)  twice. 
Also 

|27t|     =     \Etii\  1^7(22. i| 
Using  the  definition  of  Gt,  (17)  and  (18),  we  have 


(17) 


(18) 


At^E;^At     =     At^^^' i:;,\A\ 


^^-^-1  aW 


\Gt\^ 


l^tu 


*       > 


s      5 


iir;p>'.''^7, 


i^el 


(1) 


So  the  conditional  pdf  of  C\  '  given  C\  '  aaid  A  is  given  by 


McPMci^A)     ex      \G,\-'na,  +  ±a,X,  +  c'^fK^c^^^) 


UNt-n,) 
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(19) 


/=i 


X  [1+    ao  +  E«A,  +  cp)V,crM     (cp^-M^c^))' 


/=i 


xGr^lcr^-M^c^))]-^^'''-'^'-''-''^ 


(20) 


Writing 


-1 


^iV,  +  E  /»'  -  PJ       (^«o  +  E  a/A,  +  c!^)  V.cr^)) 


Gt     =     ^t 


We  have 

"  /(cpMcr\A) 


1=0 


X   [(cP^-M,cr^)]-i(^'+E,=o'"-p)} 


(21) 


(1)    _    Ji) 


From  (21)  it  follows  that  the  conditional  distribution  of  C[^^  given  Cp^     =    c 
and  A    =    A    is  (A^^  —  n,)-variate  t  distribution  with  df  Ut  +  J2Uo  ^i  —  P  ■,  location 
parameter  M<C(     and  scale  parameter  4  ,.-,  ,  ■ .  ,. 


i^^v.cr))j 


Gt 


Also,  since  /(A  |c|^^)   =   ^^^^   oc   h{c[^\  A),  we  have  from  (15) 


/(c',') 


/(A|c[^))     a      \Emn\A,^'yi:-\A<i'^\~'l[\f 


kht-l 


1=1 


i-i 


T^ 
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